2331/130 

Attorney Docket: 2331/130 

Systems and Methods for Characterizing a Biological Condition or Agent Using 
Selected Gene Expression Profiles 
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10 60/141,542, filed June 28, 1999 and provisional application serial number 60/195,522 filed 
April 7, 2000. These related applications are hereby incorporated herein by reference. 

Technical Field 

Embodiments of the recent invention provide systems and methods utilizing gene 
expression analysis for characterizing a biological condition or agent. 

15 Background Art 

There has been substantial discussion including congressional hearings concerning 
medical errors. One source of medical errors includes errors with medications. Upwards of 
98,000 hospitalized patients annually have been documented to be victims of medication 
errors (Statement of the American Pharmaceutical Association to the Senate Appropriations 

20 Committee Labor, health and Human Services Education Subcommittee Hearing on Medical 
Errors December 13, 1999). These errors include problems arising from drug interactions for 
a particular patient taking more than one drug, problems concerning the response of an 
individual to a particular drug and incorrect medication for a particular condition. Medical 
errors further arise as a result of misdiagnosis. This may occur as a result of insensitive 

25 diagnostic techniques or a wide range of interpersonal variability in the manner in which a 

clinical state is manifest. At present, there are few tools available for optimizing prognosis, 

diagnosis and treatment of a medical condition taking into account the particular phenotype 

and genotype of an individual. 

There has been increasing interest in herbal drugs or nutraceuticals. These 

30 compounds are grown and collected from around the world, and consequently the 
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compounds are subject to regional and temporal differences in collection and preparation that 
are difficult to control. It is frequently the case that one batch of a nutraceutical may be 
effective, there is no assurance that a second batch will be effective. Moreover, analysis of 
nutraceuticals is problematic because these drugs are complex mixtures in which little is 
5 known with respect to the active agent. 

All new therapeutic agents require some form of clinical trials. It is known that a 
drug for treating tumor that is tested in a clinical trial using standard recruiting techniques for 
patients, may in fact show only limited efficacy. If the beneficial effect observed in a clinical 
population is too small, the drug will not receive approval by the Food and Drug 
10 Administration for use in the population at large. However, the small beneficial effect 

observed may in fact be an artifact of the clinical trial design or the clinical endpoint in the 
population of patients. It would be desirable to have criteria for screening patients as they 
enter a clinical trial to ensure that the beneficial effect of a drug if it exists may be detected 
and quantified. 

15 Summary of the Invention 

In a first embodiment of the invention there is provided a method, for evaluating a 
biological condition of a subject, that includes: obtaining from the subject a sample having 
at least one of RNAs and proteins; deriving from the sample a first profile data set, the first 
profile data set including a plurality of members, each member being a quantitative measure 

20 of the amount of a distinct RNA or protein constituent in a panel of constituents selected so 
that measurement of the constituents enables measurement of the biological condition; and 
producing a calibrated profile data set for the panel, wherein each member of the calibrated 
profile data set is a function of a corresponding member of the first profile data set and a 
corresponding member of a baseline profile data set for the panel, the calibrated profile data 

25 set providing a measure of the biological condition of the subject. 

In another embodiment, a method is provided for evaluating a biological condition of 
a subject, that includes obtaining from the subject a first sample having at least one of fluid, 
cells and active agents; applying the first sample or a portion thereof to a defined population 
of indicator cells; obtaining from the indicator cells a second sample containing at least one 

30 of RNAs or proteins; deriving from the second sample a first profile data set, the first profile 
data set including a plurality of members, each member being a quantitative measure of the 
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amount of a distinct RNA or protein constituent in a panel of constituents selected so that 
measurement of the constituents enables measurement of the biological condition; and 
producing a calibrated profile data set for the panel, wherein each member of the calibrated 
profile data set is a function of a corresponding member of the first profile data set and a 
5 corresponding member of a baseline profile data set for the panel, the calibrated profile data 
set providing a measure of the biological condition of the subject. 

In a another embodiment, a method is provided for evaluating a biological condition 
affected by an agent, the method including: obtaining, from a target population of cells to 
which the agent has been administered, a sample having at least one of RNAs and proteins; 

10 deriving from the sample a first profile data set, the first profile data set including a plurality 
of members, each member being a quantitative measure of the amount of a distinct RNA or 
protein constituent in a panel of constituents selected so that measurement of the constituents 
enables measurement of the biological condition; and producing a calibrated profile data set 
for the panel, wherein each member of the calibrated profile data set is a function of a 

15 corresponding member of the first profile data set and a corresponding member of a baseline 
profile data set for the panel, the calibrated profile data set providing a measure of the 
biological condition as affected by the agent. 

In a another embodiment, a method is provided for evaluating the effect on a 
biological condition by a first agent in relation to the effect by a second agent, including: 

20 obtaining, from first and second target populations of cells to which the first and second 
agents have been respectively administered, first and second samples respectively, each 
sample having at least one of RNAs and proteins; deriving from the first sample a first 
profile data set and from the second sample a second profile data set, the profile data sets 
each including a plurality of members, each member being a quantitative measure of the 

25 amount of a distinct RNA or protein constituent in a panel of constituents selected so that 
measurement of the constituents enables measurement of the biological condition; and 
producing for the panel a first calibrated profile data set and a second profile data set, 
wherein (i) each member of the first calibrated profile data set is a function of a 
corresponding member of the first profile data set and a corresponding member of a first 

30 baseline profile data set for the panel, and (ii) each member of the second calibrated profile 
data set is a function of a corresponding member of the second profile data set and a 
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corresponding member of a second baseline profile data set for the panel, the calibrated 
profile data sets providing a measure of the effect by the first agent on the biological 
condition in relation to the effect by the second agent. 

In a further embodiment, a method of conducting a clinical trial of an agent, is 
5 provided, including: causing the blind administration of a selected one of a placebo and the 
agent to each candidate of a pool of subjects; and using quantitative gene expression to 
monitor an effect of such administration. 

In another embodiment, a digital storage medium is provided on which is stored a 
computer readable calibrated profile data set, wherein: the calibrated profile data set relates 

10 to a sample having at least one of RNAs and proteins derived from a target cell population to 
which an agent has been administered; the calibrated profile data set includes a first plurality 
of members, each member being a quantitative measure of a change in an amount of a 
distinct RNA or protein constituent in a panel of constituents selected so that measurement of 
the constituents enables measurement of a biological condition as affected by administration 

15 of the agent. 

In another embodiment, a digital storage medium is provided on which is stored a 
plurality of records Ri relating to a population of subjects, each record /?/ corresponding to a 
distinct instance P, of a computer readable profile data set P wherein: each instance Pi of the 
profile data set P relates to a distinct sample derived from a subject, the sample having at 

20 least one of RNAs and proteins; the profile data P set includes a plurality of members Mj, 
each member Mj being a quantitative measure of the amount of a distinct RNA or protein 
constituent in a panel of constituents selected so that measurement of the constituents enables 
measurement of a biological condition; each record Ri includes, for each member My of a 
corresponding distinct instance Pi of the profile data set P, a value corresponding to the value 

25 of the member Af,y; and each record Ri also includes a reference to a characteristic of the 
subject relative to the record, the characteristic being at least one of age group, gender, 
ethnicity, geographic location, diet, medical disorder, clinical indicator, medication, physical 
activity, body mass, and environmental exposure. 

In a further embodiment, a digital storage medium is provided on which is stored a 

30 large number of computer readable profile data sets, wherein each profile data set relates to a 
sample derived from a target cell population to which has been administered an agent, the 
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sample having at least one of RNAs and proteins; each profile data set includes a plurality of 
members, each member being a quantitative measure of the amount of a distinct RNA or 
protein constituent in a panel of constituents selected so that measurement of the constituents 
enables measurement of a biological condition; and the panel is the same for all profile data 
5 sets. 

In a another embodiment of the invention, a method is provided for evaluating a 
biological condition of a subject, based on a sample from the subject, the sample having at 
least one of RNAs and proteins, the method including: deriving from the sample a first 
instance of a profile data set, the profile data set including a plurality of members, each 

10 member being a quantitative measure of the amount of a distinct RNA or protein constituent 
in a panel of constituents selected so that measurement of the constituents enables 
measurement of the biological condition; and producing a first instance of a calibrated profile 
data set for the panel, wherein each member of an instance of the calibrated profile data set is 
a function of a corresponding member of an instance of the profile data set and a 

15 corresponding member of an instance of a baseline profile data set for the panel, the 

calibrated profile data set providing a measure of the biological condition of the subject; 
accessing data in a condition database, the condition database having a plurality of records 
relating to a population of subjects, each record corresponding to a distinct instance of the 
calibrated profile data set; and evaluating the first instance of the calibrated profile data set in 

20 relation to data in the condition database. 

In another embodiment of the invention, a method is provided of displaying 
quantitative gene expression analysis data associated with measurement of a biological 
condition, the method including: identifying a first profile data set pertinent to the gene 
expression analysis data, the first profile data set including a plurality of members, each 

25 member being a quantitative measure of the amount of a distinct RNA or protein constituent 
in a panel of constituents selected so that measurement of the constituents enables 
measurement of the biological condition; producing a calibrated profile data set for the panel, 
wherein each member of the calibrated profile data set is a function of a corresponding 
member of the first profile data set and a corresponding member of a baseline profile data set 

30 for the panel, the calibrated profile data set providing a measure of the biological condition 
of the subject; and displaying the calibrated profile data set in a graphical format. 
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Another embodiment is directed to a descriptive record of a change in a biological 
condition, that includes: a first set of numerical gene expression values for a panel of gene 
loci, each value in the set corresponding to a single gene locus in a panel of gene loci, the set 
of values forming a profile data set for a population of cells subjected to a first biological 
5 condition; a second set of numerical gene expression values for the panel of gene loci, each 
value in the set corresponding to a single gene locus, the set of values forming a baseline 
profile data set for a second population of cells subjected to a second biological condition, 
the second set of values optionally being an average for multiple gene expression values 
from multiple populations of cells for each locus in the panel; and a third set of numbers 

10 corresponding to the ratio of the first set of values and the second set of values with respect 
to each gene locus in the panel, the third set being a calibrated profile data set; the profile 
data set and the calibrated profile data set being descriptive of the first biological condition 
with respect to the second biological condition. 

In another embodiment, a method for diagnosing a biological condition of a subject is 

15 provided that includes: obtaining a sample from a subject; subjecting a population of cells to 
the sample and determining the presence of a first biological condition with respect to a 
second biological condition according to any of the above claims. 

In another embodiment, a method is provided for diagnosing a susceptibility for a 
biological condition in a subject, that includes obtaining a sample from the subject; creating a 

20 descriptive record, according to the above, wherein the baseline set of values is an average of 
second values contained in a library of descriptive records for the second biological 
condition; the library containing a plurality of descriptive records grouped according to a 
predetermined biological condition; comparing the calibrated profile data set of the subject 
with the library of calibrated profile data sets and diagnosing the susceptibility of the subject. 

25 In another embodiment, a method is provided for monitoring the progress of a 

biological condition, including: creating a plurality of descriptive records, according to the 
above; wherein each set of first values is determined at preselected time intervals with 
respect to the first record; comparing each calibrated profile data set with a library of 
calibrated profile data sets, the plurality of calibrated profile data sets being grouped 

30 according to a predetermined biological condition; and determining the progress of the 
biological condition with respect to gene expression. 
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In another embodiment, a method is provided for establishing the biological activity 
of a composition, including: selecting a population of cells; subjecting the cells to the 
composition; and determining the record according to the above description using a 
standardized baseline profile data set for the biological condition. 
5 In another embodiment, a method is provided for determining which therapeutic 

agent from a choice of a plurality of therapeutic agents to administer to a subject so as to 
change a biological condition in a subject from a first biological condition to a second 
biological condition; including: subjecting a sample from the subject to each of a plurality of 
therapeutic agents; determining a descriptive record for each of the samples according to any 

10 of the above described methods, comparing each of the calibrated profile data sets to a 
library of calibrated profile data sets, the library of calibrated data sets being grouped 
according to a predetermined biological condition; and determining which of the therapeutic 
agents is capable of changing the first biological condition in the subject to the second 
biological condition in the subject. 

15 In another embodiment, a method is provided for characterizing the biological 

effectiveness of a single batch of a composition produced by a manufacturing process, 
comprising: providing a fingerprint or signature profile according to any of the above 
methods; and labeling the batch of the composition by placing the fingerprint (signature 
profile) on each container in the batch. 

20 In another embodiment, a method is provided for accessing biological information on 

a digital storage medium as described above, including: making the information available to 
a user. 

In another embodiment, a method is provided for consumer evaluation of a product , 
wherein the consumer evaluation is dependent on a signature profile, including: identifying 
25 the product using the signature profile. 

In another embodiment, a computer program product is provided for evaluating a 
biological condition of a subject or for evaluating a biological condition resulting from the 
use of an agent, including a computer usable medium having computer readable program 
code thereon, the computer program code; including: a program code for classifying a 
30 sample from the subject or the agent for an identifiable record; a program code for deriving a 
first data set, the first profile data set including a plurality of members, each member being a 
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quantitative measure of the amount of a distinct RNA or protein constituent in a panel of 
constituents selected so that measurement of the constituents enables measurement of the 
biological condition; the profile data set being stored in the record; and a program code for 
optionally producing a calibrated profile data set for the panel, for storage in the record, each 
5 member of the calibrated profile data set being a function of a corresponding member of the 
first profile data set and a corresponding member of a baseline profile data set for the panel, 
the calibrated profile data set providing a measure of the biological condition of the subject. 

In another embodiment of the invention, a computer system for evaluating a 
biological condition of a subject or for evaluating a biological condition resulting from the 

10 use of an agent is provided, the computer system, including: a classification module for 
classifying a sample from the subject or the agent in an identifiable record; a derivative 
module for deriving a first data set, the first profile data set including a plurality of members, 
each member being a quantitative measure of the amount of a distinct RNA or protein 
constituent in a panel of constituents selected so that measurement of the constituents enables 

15 measurement of the biological condition; and a production module for producing a calibrated 
profile data set for the panel, wherein each member of the calibrated profile data set is a 
function of a corresponding member of the first profile data set and a corresponding member 
of a baseline profile data set for the panel, the calibrated profile data set providing a measure 
of the biological condition of the subject. 

20 In another embodiment, a method is provided for analyzing a patient for a biological 

condition at a remote site, including: providing a kit for measuring a profile data base for 
evaluating a biological condition, the kit including reagents for quantitative analysis of RNA 
or protein for a panel of gene loci; accessing a centralized database containing baseline 
profile data sets corresponding to the panel; determining the calibrated profile data set for the 

25 patient; and analyzing the biological condition of the patient. 

Further embodiments of the invention include the use of calibrated profile data bases 
for determining the biological condition at one site in a subject from a sample taken from a 
second remote site. The biological condition may include disease, therapeutic interventions, 
aging, health conditioning and exercise, exposure to toxins, status of infection and health 

30 status. For example, calibrated precision profiles may be used to measure a biological 

condition(s) in one site (for example, the liver) by sampling cells from the same subject, but 
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at a different site not generally considered a target for the biological condition, for example, 
peripheral blood cells in the case of liver disease. 

Further embodiments of the invention include the use of calibrated profile data bases 
for determining the biological condition of the subject that includes placing a cell or fluid 
5 sample on indicator cells to assess the biological condition, the biological condition 
including disease, therapeutic interventions, aging, health conditioning and exercise, 
exposure to toxins, status of infection and health status. 

Further embodiments of the invention include the use of calibrated profile data bases 
and profiles to assess, compare and contrast the bioactivities of therapeutic agents and 
10 therapeutic agent candidates including comparison of two agents having unknown properties 
; comparison of agents that are complex mixtures against those that are simple mixtures and 
comparisons of a single agent against a class of agents. 

Further embodiments of the invention include the use of calibrated profile databases 
derived from in vitro dosing of an agent in indicator cells, or fluids or cells ex vivo to predict 
15 in vivo activities, activities including efficacy and toxicity and further permitting data on 
short term in vivo dosing of agent to predict long-term activities as described herein. 

Another embodiment of the invention is at least one databases and its uses, the 
databases containing at least one of calibrated profile data sets and baseline profile data sets 
for discrete populations identified according to factors including diseases, geography, 
20 ethnicity, age and state of health. 

A further embodiment of the invention is a database corresponding to an individual 
over time, the uses including managing a personalized health care program. 

Additional embodiments include methods of running a clinical trial using calibrated 
profile data and databases containing calibrated profile data from in vitro and in vivo studies 
25 of the effect of the agent on populations of cells and methods of building a clinical research 
network that uses calibrated profile data and traditional medical data. 

Another embodiment of the invention provides a method, for evaluating a biological 
condition of a subject. This method includes: 

a. obtaining from the subject a sample having at least one of RNAs and 

30 proteins; 

b. deriving from the sample a first profile data set, the first profile dataset 
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including a plurality of members, each member being a quantitative measure of the amount 
of a distinct RNA or protein constituent in a panel of constituents selected so that 
measurement of the constituents enables measurement of the biological condition; and 
c. producing a calibrated profile data set for the panel, wherein each 
5 member of the calibrated profile data set is a function of a corresponding member of the first 
profile data set and a corresponding member of a baseline profile data set for the panel, the 
calibrated profile data set providing a measure of the biological condition of the subject. 

In this embodiment, the biological condition relates to inflammation and the panel 
includes at least half, and, optionally, at least eighty percent of the constituents of the 

10 Inflammation Selected Panel of Table 1. In a related embodiment, the biological condition 
relates to cell growth and differentiation and the panel includes at least half, and optionally 
at least eighty percent, of the constituents of the Cell Growth and Differentiation Selected 
Panel of Table 2. In other related embodiments, the biological condition relates to 
metabolism and toxicity and the panel includes at least half, and optionally at least eighty 

15 percent, of the constituents of the Liver Metabolism and Toxicity Selected Panel of Tables 3 
or 7. In another related embodiment, the biological condition relates to skin response and the 
panel includes at least half, and optionally at least eighty percent, of the constituents of the 
Skin Response Selected Panel of Table 4. In another related embodiment, the biological 
condition relates to the vascular system and the panel includes at least half, and optionally, at 

20 least eighty percent, of the constituents of the Vascular Selected Panel of Table 6. In a further 
related embodiment, the biological condition relates to the prostate health and disease and the 
panel includes at least half, and optionally at least eighty percent of the constituents of the 
Prostate Selected Panel of Table 5. 

Another embodiment of the invention provides a method, for evaluating a biological 

25 condition of a subject, that includes: obtaining from the subject a sample having at least one 
of RNAs and proteins; deriving from the sample a profile data set, the profile data set 
including a plurality of members, each member being a quantitative measure of the amount 
of a distinct RNA or protein constituent in a panel of constituents selected so that 
measurement of the constituents enables measurement of the biological condition; wherein 

30 such measurement is performed for each constituent under conditions wherein efficiencies 
of amplification for all constituents are substantially similar, the profile data set providing a 
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measure of the biological condition of the subject. 

Another embodiment of the invention provides a method, for evaluating a biological 
condition of a subject, that includes: obtaining from the subject a first sample having at least 
one of fluid, cells and active agents; applying the first sample or a portion thereof to a 
5 defined population of indicator cells; obtaining from the indicator cells a second sample 
containing at least one of RNAs or proteins; deriving from the second sample a profile data 
set, the profile data set including a plurality of members, each member being a quantitative 
measure of the amount of a distinct RNA or protein constituent in a panel of constituents 
selected so that measurement of the constituents enables measurement of the biological 

10 condition; wherein such measure is performed for each constituent under conditions wherein 
efficiencies of amplification for all constituents are substantially similar, the profile data set 
providing a measure of the biological condition of the subject. 

Another embodiment of the invention provides method for evaluating a biological 
condition affected by an agent, the method that includes obtaining, from a target population 

15 of cells to which the agent has been administered, a sample having at least one of RNAs and 
proteins; deriving from the sample a profile data set, the profile data set including a plurality 
of members, each member being a quantitative measure of the amount of a distinct RNA or 
protein constituent in a panel of constituents selected so that measurement of the constituents 
enables measurement of the biological condition; wherein such measure is performed for 

20 each constituent under conditions wherein efficiencies of amplification for all constituents 
are substantially similar, the profile data set providing a measure of the biological condition 
as affected by the agent. 

Efficiencies of amplification of all constituents may differ by less than approximately 
2%. The efficiencies of amplification may differ by less than approximately 1%. Moreover, 

25 in any of the embodiments of the invention described above which refers to a panel, the 
panel may include at least four constituents selected 

from any one of Tables 1 through 7. For example, at least four constituents may be selected 
from the group consisting of expression products of TNF-ot, IL-l-a, IL-p, IFN-y, IL-8, and 
IL-10. 

30 In another embodiment of the invention, a kit is provided having primer-probe 

combinations for measuring expression products of at least four constituents selected from 
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any one of Tables 1 through 7. The kit may further include a primer probe combination 
constructed so as to hybridize only to at least one of cDNA and mRNA at a biologically 
relevant locus. Moreover, in each combination, a reverse primer may be selected which is 
complementary to a coding DNA strand located across an intron-exon junction, with not 
5 more than three bases of a three-prime end of the reverse primer being complementary to a 
proximal exon. 

Brief Description of the Drawings 
The foregoing features of the invention will be more readily understood by reference 
to the following detailed description, taken with reference to the accompanying drawings, in 
10 which: 

Figure 1 is a diagram showing the flow of information from data acquired in 
molecular pharmacology and toxicology, clinical testing, and use of the data for the 
application to individualized medicine. 

Figure 2 is a diagram showing the drug discovery pathway of new compounds from 
15 early leads to likely drug candidates. Although calibrated profile data sets are indicated at 
the pre-clinical step, gene expression data can be acquired and is useful at any of the stages 
shown. END refers to investigative new drug and refers to an early stage in regulatory 
review. 

Figure 3 is a diagram presenting a comparison of in vivo and in vitro protocols for 
20 forming calibrated profile data sets for rapidly assessing product candidate toxicity and 
efficacy in accordance with several embodiments of the present invention. 

Figure 4 is a diagram showing the application of gene expression profiling as a guide 
to pre-clinical and clinical studies in accordance with an embodiment of the present 
invention. 

25 Figure 5 is a diagram showing a method in accordance with an embodiment of the 

present invention for obtaining profile data in the absence of a stimulus and in the presence 
of a stimulus. 

Figure 6 is a diagram showing the creation of a library of profile data associated with 
a plurality of subjects in accordance with an embodiment of the present invention. 
30 Figure 7 is a diagram illustrating the structure of a profile data record in accordance 

with an embodiment of the present invention. 
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Figure 8 is a diagram illustrating a data entry screen for a data record of the type 
shown in Figure 7 and typical contexts in which data records may be compiled in accordance 
with embodiments of the present invention. 

Figure 9 shows an embodiment of the present invention in which profile data, in 
5 either the raw or calibrated form, is evaluated using data from a database that is remotely 
accessed over a network. 

Figure 10 shows a schematic of a phase two clinical trial that utilizes gene expression 
profiling (a). The right hand panel (b) indicates that the same information may be used in 
Phase IV or post marketing studies to compare the efficacy of already approved and 

10 marketed drugs or to guide the marketing of such therapies; to guide the choice of therapy for 
an individual subject or population from within a class of appropriate compounds. 

Figure 1 1 is a bar graph that shows a graphical representation in the form of a 
histogram representing calibrated profile data sets based on quantitative expression of RNA 
in cells of a whole blood sample using a panel of 12 constituents where each constituent 

15 corresponds to a unique gene locus, (a) The blood sample is stimulated ex vivo with heat 
killed staphylococci are further exposed H7-TPCK, H9-UT-77, or H16-Dex as indicated. 
The baseline profile data set is a blood sample stimulated ex vivo (in vitro) with heat killed 
staphylococci (b) The blood sample is stimulated ex vivo with lipopolysaccharide (LPS) and 
is then further exposed to compounds H7-TPCK, H9-UT-77, or H16-Dex as indicated. 

20 Figure 12 is a bar graph with a logarithmic axis that shows a graphical representation 

of calibrated profile data sets for whole blood stimulated ex vivo with lipopolysaccharide 
(LPS), using a panel of 9 constituents, each constituent corresponding to a gene locus 
encoding the gene products indicated, the blood being further exposed to anti-inflammatory 
agents: methotrexate, meclofenamate and methylprednisolone. The baseline profile data set 

25 is derived from LPS stimulated (but otherwise untreated) cells. 

Figure 13 are bar graphs with a logarithmic axis that shows a graphical representation 
of calibrated profile data sets for two different samples of whole blood (a) 991 1 16 and (b) 
991028 reflecting the biological condition of the cells using a panel of 24 members, each 
member corresponding to a gene locus, the baseline profile data set being derived from 

30 untreated cells. The calibrated data sets for cells exposed for six hours to three inflammation 
inducing agents (lipopolysaccharide, heat killed staphylococci, and phytohemagglutinin) are 
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compared for each sample, (c) shows a direct comparison of LPS stimulated 991 1 16 with 
respect to 991028 as the baseline profile data set (d) shows a direct comparison between 
unstipulated 991 1 16 and 991028. 

Figure 14 is a bar graph with a logarithmic axis that shows a graphical representation 
5 of calibrated profile data sets using a panel of 22 constituents, each constituent corresponding 
to a gene locus, the baseline profile data set being derived from untreated cells. Whole blood 
is exposed for six hours ex vivo to three inflammation inducing agents (lipopolysaccharide, 
heat killed staphylococci, and phytohemagglutinin) which are then treated with a single anti- 
inflammatory agent (methyl prednisolone) to reveal similarities and differences in the effect 

10 of a single agent on cell populations differing in their biological condition. 

Figure 15 is a bar graph with a logarithmic axis that shows a graphical representation 
of calibrated profile data sets for whole blood where one calibrated data set refers to a subject 
(subject 2) who has been treated in vivo with a corticosteroid (dexamethasone), a second data 
set refers to the treatment of a blood sample from the same subject prior to in vivo treatment 

15 where that sample has been treated ex vivo (in vitro) and the third data set refers to a second 
subject treated in vivo with dexamethasone (subject 1). The data sets demonstrate the 
reproducibility and predictability of an ex vivo (in vitro) treatment of blood compared to in 
vivo treatment with the same agent. The figure also shows minor variation between samples 
from different subjects reflecting interpersonal variability. A panel of 14 constituents is 

20 provided. The baseline profile data set is derived from untreated whole blood from the 
cognate subject. 

Figure 16 is a bar graph with a logarithmic y axis that shows a graphical 
representation of calibrated profile data sets for whole blood where one calibrated data set 
refers to (a) 2 subjects who have been treated in vivo with an inactive placebo for 3 days and 

25 (b) active prednisolone for 3 days at 100 mg/day. The data set shows some variation between 
samples from different subjects treated with the same drug. The data sets demonstrate 
similarity of responses across the same gene loci, as well as, quantitative variation at other 
loci suggesting quantifiable interpersonal variation. A panel of eight members is provided. 
The baseline profile data set is derived from untreated whole blood. 

30 Figure 17 is a bar graph with logarithmic y axis that shows a graphical representation 

of calibrated selected profile data sets for two samples taken from a single subject within a 
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19 day period using a panel (e.g., inflammation panel) of 24 members where each member 
corresponds to a unique gene locus. The baseline profile data set relates to peripheral blood 
taken from the subject prior to treatment. 

Figures 18(a) through 18(e) are bar graphs with a logarithmic axis that show a 
5 graphical representation of calibrated profile data sets for each of 5 subjects from which a 
blood sample has been taken. Each of the blood samples was exposed to the inflammatory 
agent phytohemagglutinin (PHA) or to a therapeutic agent (anti-inflammatory agent) at 
different concentrations: O.IjjM, 0.3uM, l^iM, 3[iM and 5|llM, for a 4 hour period ex vivo (in 
vitro) so as to determine the optimum dose for treating the subject. A panel of 6 constituents 

10 was used corresponding to 6 gene loci. The baseline profile data set was an untreated sample 
obtained from the cognate donor. 

Figure 19 is a bar graph with a logarithmic axis that shows a graphical representation 
of calibrated profile data sets for three different subjects having different biological 
conditions using a panel with 24 constituents. The profile data sets show variability 

15 according to these conditions providing the basis for a diagnostic signature panel: (a) shows 
a calibrated profile data set for a smoker against a baseline for a non-smoker, (b) shows a 
calibrated profile data set for a subject with chronic obstructive pulmonary disease against a 
baseline for a subject lacking this disease. The baseline profile data set is derived from a 
subject that is "normal" with respect to these conditions. 

20 Figure 20 illustrates that an individual responses can be distinguished from a 

similarly treated population. A comparison of the response of a single animal compared to 
its experimental cohort (n=5 animals) with respect to a single locus (GST-P) is provided. 
The baseline data set is the cohort average. The figures shows that this animal varied 
significantly from the daily, population average in the first two days of the study, but became 

25 more similar to the cohort average with time after treatment with acetaminophen. 

Figure 21 is a bar graph with a logarithmic axis that shows a graphical representation 
of calibrated profile data sets for samples of blood treated ex vivo with LPS or LPS and one 
of three anti-inflammatory herbals (Echinacea, Arnica or Siberian Ginseng) at a 
concentration of 200 ug/ml. A panel of 24 constituents is used. The baseline profile data set 

30 is derived from LPS stimulated cells absent a herbal treatment. The figure illustrates the 
effectiveness of the use of the calibrated selected profile to investigate the overall effects of 
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complex compounds such as nutraceuticals whose biological effect is a summation of more 
than one activity. In this case, each of the herbals is consumed as an immunostimulant, 
however the calibrated selected profiles reveal a unique pattern shows a mixture of both 
immunostimulatory and anti-inflammatory effects. 
5 Figure 22 is a bar graph with a logarithmic axis that shows a graphical representation 

of calibrated profile data sets for samples of blood treated ex vivo with LPS or LPS and 
methylprednisolone or LPS and Arnica. The baseline profile data set is LPS treated blood 
sample. 

Figure 23 is a bar graph with a logarithmic axis that shows a graphical representation 
10 of calibrated profile data sets for samples of THP-1 cells treated with LPS or LPS and Arnica 

at three different concentrations using a panel of 22 constituents. The baseline profile data 

set is untreated THP-1 cells. The figure illustrates a concentration response with respect to 

the gene expression across the calibrated profile. 

Figure 24 is a bar graph with a logarithmic axis that shows a graphical representation 
15 of calibrated profile data sets for samples of THP-1 cells treated ex vivo with four different 

commercial brands of Echinacea using a panel of 8 constituents. The baseline profile data 

set is untreated THP-1 cells. 

Figure 25 illustrates the use of the calibrated profile to compare relative efficacy 

across brands, or different formulations. Calibrated profile data sets for herbal preparations 
20 from different manufacturing sources with respect to an indicator monocytic cell line (THP- 

1) are shown graphically, the baseline profile data set being THP-1 cells absent the herbal. 

(a) Three commercial herbal Echinacea preparations at 250 (|ig/ml); (b) three herbal 

preparations at different concentrations (250|ig/ml, 50|Lig/ml and 3-10|ig/ml) (c) four 

commercial Echinacea brands at 250 (ig/ml). 
25 Figures 26(a) through 26(d) illustrate calibrated profile data sets, using a subset of the 

Inflammation Selected Panel, that show the effect of administration of a steroid. 

Figures 27(a) through 27(d) illustrate calibrated profile data sets, using a subset of the 

Inflammation Selected Panel, providing a comparison of the effects of administration of 

methylprednisolone and Ibuprofen. 
30 Figures 28(a) through 28(d) illustrate calibrated profile data sets, using a subset of the 

Inflammation Selected Panel, in identifying chronic obstructive pulmonary disease (COPD) 
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patients. 

Figures 29(a) and 29(b) provide illustrations in which evaluations of the effects of 
drug exposure performed in vitro correspond closely with evaluations performed in vivo, 
employing in each case calibrated profile data sets, using a subset of the Inflammation 
5 Selected Panel. 

Figure 30 illustrates the effect of different agents evaluated using a subset of the 
Selected Prostate Panel, and shows broad functions of constituents of the panel. 

Figure 31 illustrates the effect of the pharmaceutical agent ,clofibrate, as measured on 
a rat liver metabolism selected panel. The profiles for six rats are provided as indicated on 

10 the z axis. The control (baseline) is a set of rats treated only with the carrier control. 

Figure 32 illustrates the ability of the rat metabolism selected panel to differentiate 
drug responses (clofibrate versus benzo[a]pyrene) in Spraque-Dawley rats. Clofibrate (right 
hand bars) and Benzopyrene (left hand bars). The control (baseline) is a set of rats treated 
only with the carrier control. Figure 33 illustrates the effect of administration of a stimulant 

15 (TNF-a) as measured by a combination of constituents selected from the inflammation, 
skin/epithelial, and vascular selected panels. The target is human keratinocytes in culture. 
The baseline is non stimulated cells. The baseline is a set of rats that were non-stimulated. 
Figure 34 illustrates the effect of administration of benzo[a]pyrene on cryo-preserved human 
hepatocytes over time as measured by the human liver selected panel. The control (baseline) 

20 are cells treated similarly but without the addition of benzo[a]pyrene. Figure 35 demonstrates 
the effect of treating human umbilical vein endothelial cells in culture with TNFa. for 24 
hours. The control or baseline is established from cells handled similarly but without the 
addition of the stimulant. 

Figure 36 illustrates the protective effect of the antioxidant n-acetylcysteine (NAC) 

25 on human keratinocytes in culture after exposure the UVB energy. The dark bars indicate the 
effect of UVB exposure only. Cells that were treated with NAC followed by exposure to the 
same UVB energy show a decreased induction of expression at most of the gene loci covered 
by the skin selected panel. The baseline corresponds to cells exposed to assay media only. 
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Detailed Description of Specific Embodiments 
As used in this description and the accompanying claims, the following terms shall 

17 



2331/130 

have the meanings indicated, unless the context otherwise requires: 

A "collection of cells" is a set of cells, wherein the set has at least one constituent. 
A "population of cells" includes one or more cells. A population of cells may refer to 

cells in vivo or to in vitro cultures. In vitro cultures may include organ cultures or cell 
5 cultures where cell cultures may be primary or continuous cell cultures of eukaryotic or 

prokaryotic cells. Cell lines can be primary cultures or cell samples, e.g. from a tumor, from 

blood or a blood fraction, or biopsy explants from an organ, or can be established cell lines 

or microbial strains. 

A "region of the subject" from which proteins are obtained may (but is not required 

10 to be) the same part of the subject from which has been obtained a collection of cells or a 
population of cells. The cells and the proteins may both be obtained from blood of the 
subject, for example. Alternatively, for example, the cells may be obtained from blood and 
the proteins may be obtained from a scraping of tissue or vice versa. Similarly, the proteins 
may be obtained from urine of the subject, for example, whereas the cells may be obtained 

15 elsewhere, as, for example, from blood. 

A "panel" of genes is a set of genes including at least two constituents. 
A "normative" condition of a subject to whom a composition is to be administered 
means the condition of a subject before administration, even if the subject happens to be 
suffering from a disease. 

20 An "expression" of a gene includes the gene product whether RNA or protein 

resulting from translation of the messenger RNA. 

A "large number" of data sets based on a common panel of genes is a number of data 
sets sufficiently large to permit a statistically significant conclusion to be drawn with respect 
to an instance of a data set based on the same panel. 

25 A "biological condition" of a subject is the condition of the subject in a pertinent 

realm that is under observation, and such realm may include any aspect of the subject 
capable of being monitored for change in condition, such as health, disease including cancer; 
trauma; aging; infection; tissue degeneration; developmental steps; physical fitness; obesity, 
or mood. As can be seen, the conditions may be chronic or acute or simply transient. 

30 Moreover, a targeted biological condition may be manifest throughout the organism or 

population of cells or may be restricted to a specific organ (such as skin, heart, eye or blood). 
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The term "biological condition" includes a "physiological condition". 

The "blind administration" of a selected one of a composition or placebo to a subject 
in a clinical trial involves administering the composition or placebo to the subject in 
accordance with a protocol pursuant to which the subject lacks knowledge whether the 
5 substance administered is the composition or a placebo. 

An "organism" is any living cell including microorganisms, animals and plants. An 
animal is commonly in this context a mammal, but may be a vertebrate non-mammal, as e.g., 
a zebra fish, or an invertebrate, as, e.g. Caenorhabditis elegans. 

An "agent" is a composition or a stimulus. A "stimulus" may include, for example 
10 ultraviolet A or B, or light therapy for seasonal affective disorder, or treatment of psoriasis 
with psoralen or treatment of melanoma with embedded radioactive seeds, other radiation 
exposure, etc. 

A "composition" includes a chemical compound, a nutraceutical, a pharmaceutical, a 
homeopathic formulation, an allopathic formulation, a naturopathic formulation, a 
15 combination of compounds, a toxin, a food, a food supplement, a mineral, and a complex 
mixture of substances, in any physical state or in a combination of physical states. 

A "clinical indicator" is any physiological datum used alone or in conjunction with 
other data in evaluating the physiological condition of a collection of cells or of an organism. 
This term includes pre-clinical indicators. 
20 A "selected panel" is an experimentally verified set of constituents, each constituent 

being a distinct expressed product of a gene, whether RNA or protein, wherein constituents 
of the set are selected so that their measurement provides a measurement of a targeted 
biological condition A "selected profile" is a set of values associated with constituents of a 
selected panel resulting from evaluation of a biological sample (or population of samples). 
25 A "signature profile" is an experimentally verified subset of a selected profile 

selected to discriminate a biological condition, agent or physiological mechanism of action. 
A "signature panel" is a subset of a selected panel, the constituents of which are selected to 
permit discrimination of a biological condition, agent or physiological mechanism of action. 
"Distinct RNA or protein constituent" in a panel of constituents is a distinct 
30 expressed product of a gene, whether RNA or protein. 

"Precision" as used herein, means the repeatability and reproducibility of an assay for 
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gene expression in terms of variance or standard deviation where repeatability is the measure 
of variance of a single sample measured many times and reproducibility is the variance 
accounting for assaying similar biological conditions (i.e., similar samples) using multiple 
instruments, reagent lots, and multiple operators over time. Repeatability is sometimes 
5 referred to as intra-assay variation and reproducibility may be referred to as inter-assay 
variability. Reproducibility describes the variance around similar samples obtained from a 
population of individuals with a similar biological condition including normal, good health 
or clinically definable disease states. 

"Substantially reproducible" as used herein means that the variance, as measured by 
10 coefficient of variation or standard deviation, for repeatability and reproducibility is less than 
about 3%. 

"Reproducible" as used herein means that each member of a profile data set is 
reproducible within a range for repeatability and reproducibility of within a range of 20%, 
and more particularly within a range of 10%. 

15 An embodiment of the invention includes the formation of profile data sets and 

calibrated data sets that describe a biological condition or an effect of an agent on a 
biological condition. A profile data set represents a set of values that correspond to 
measurements of gene expression wherein the measurement values themselves are 
informative. A calibrated data set represents a set of values that correspond to variations in 

20 gene expression where the variations are informative. This approach does not require 
comprehensive analysis of all gene expression in target cells associated with a particular 
condition. Nor is any one single gene locus necessarily of particular significance. Rather a 
pattern of expression or variation (a profile) is sought that correlates, in a reproducible 
manner, with a particular condition. There may be no a priori knowledge of a correlation but 

25 rather a correlation may be established by evaluating a panel of constituents of reasonable 
size (for example up to 100 constituents) and iteratively testing the gene expression profiles 
for different subjects or for the same subject from which the most informative loci for a 
particular condition may be selected. An informative subgroup of constituents in a panel 
may be selected that consistently vary for a particular condition and this subgroup may then 

30 become the signature panel, the signature panel giving rise to a signature profile. 

In another embodiment of the invention, the profile data sets by themselves describe a 
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biological condition or an effect of an agent on a biological condition. A profile data set 
represents a set of gene expression values that correspond to particular gene constituents in a 
panel of constituents, wherein the panel may be two or more constituents selected to include 
gene loci that directly or indirectly vary with a particular biological condition, and wherein 
5 the gene expression values themselves are informative. This approach does not require 
comprehensive analysis of all gene expression in target cells associated with a particular 
condition. Nor is any one single gene locus necessarily of particular significance. Rather a 
pattern of expression (a profile) is sought that correlates, in a reproducible manner, with a 
particular condition. There may be no a priori knowledge of a correlation but rather a 

10 correlation may be established by evaluating a panel of constituents of reasonable size (for 
example up to 100 constituents) and iteratively testing the gene expression profiles for 
different subjects or for the same subject from which the most informative loci for a 
particular condition may be selected. As described above, an informative subgroup of 
constituents in a panel may be selected that consistently vary for a particular condition and 

15 this subgroup may then become the signature panel, the signature panel giving rise to a 
signature profile. 

In further embodiments of the invention, any calibrated data set for an individual that 
has more members than reflective of a single signature panel may be mined for calibrated 
profiles that correspond to additional signature panels, thereby potentially providing new 

20 insights into mechanisms of action of a biological condition on sets of genes. Measurement 
of changes in transcribed RNA in a cell as a result of an environmental change or aging is an 
exquisitely sensitive measure of the response of a cell. Techniques available today to 
quantify transcribed RNA in a cell add to the sensitivity of the approach. Embodiments of 
the invention that are directed to patterns of change in amounts of transcribed RNA provide a 

25 means to focus and interpret this rich information. 

In contrast to the above approach, much attention in the prior art has been directed to 
the sequencing of the human genome and the identification of all the genes encoded therein. 
Accompanying the growing amount of sequence data, microarrays provide a means to survey 
many hundreds to thousands of gene sequences. Microarrays are being used to provide DNA 

30 profiles that identify mutations in an individual and those mutations will be associated with 
predictions concerning development of disease in those individuals. 
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Transcriptomics and proteomics are now the focus of increasing attention. These 
studies are directed to analyzing the entire body of RNA and protein produced by living 
cells. Microarrays provide a method for analyzing many thousands of different human 
RNAs as to whether they are expressed and by which cells. For example, a project 
5 undertaken by the National Cancer Institute and others to examine mRNAs produced by 
various types of cancer cells, have revealed 50,000 genes that are active in one or more 
cancers. The goal of these studies is to identify novel cancer drugs that are directed to 
knocking out or enhancing the production of certain proteins. (Kathryn Brown, The Human 
Genome Business Today, Scientific American, July 2000, p.50; Julia Karow, The "Other" 

10 Genomes, Scientific American, July 2000, p.53; Ken Howard, "The Bioinformatics Gold 
Rush, Scientific American, July 2000, p.58; Carol Ezzell, Beyond the Human Genome, 
Scientific American, July 2000, p.64; all incorporated by reference.) Major efforts in 
correlating genetic variation of individuals and the functional interrelationships of genes in 
health and disease are being conducted in a variety of consortia including the single 

15 nucleotide polymorphism consortium and the Human Epigenome Consortium (Beck et al. 
Nature BioTechnology 17 (1999) p 1 144). The Epigenome Consortium plans to analyze sets 
of genome fragments from both healthy and diseased individuals in the 500 different human 
tissues (Bioworld International: December 22, 1999). This approach seeks to correlate 
absolute expression of genes associated with a particular condition with the presence of that 

20 condition. Examples of prior art that seek to measure gene expression in absolute amounts 
including by subtractive methods or by determining amounts with respect to housekeeping 
genes or by targeting a single gene expression system are U.S. 5,643,765; U.S. 5,81 1,231; 
U.S. 5,846,720; U.S. 5,866,330; U.S. 5,968,784; U.S. 5,994,076; WO 97/41261; WO 
98/24935; WO 99/1 1822; WO 99/44063; WO 99/46403; WO 99/57130; WO00/22172 and 

25 WO00/11208. 

We have taken a different and novel approach to the above by identifying 
reproducible patterns of gene expression that are informative by virtue of the degree of 
variation between a sample and a baseline, for example, in a subject with the condition and a 
subject without the condition. The variations may be correlated with other non-genetic 

30 indications such as clinical indicators (for humans) of a traditional nature but are not required 
per se to be causative. Accordingly, the amount of gene expression product (for example 
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RNA transcript) produced by a gene locus in a cell under certain circumstances is measured 
and then stored as a value in a first profile data set. This value is calibrated with respect to a 
second value (a baseline profile data set) to provide a member of a calibrated profile data set. 
The values recorded for the profile data set, relying on a particular baseline data set to 
5 produce a calibrated data set, become part of the descriptive record; any or all of these results 
can be stored in a database which may be accessed through a global network. In this way any 
new data in the form of a profile data set or a calibrated profile data set measured at any 
global location can be directly compared to an archive of descriptive records including 
calibrated profile data sets and baseline data sets so as to extend the stored library of profiles 
10 and provide predictive, diagnostic, or evaluative data about a particular biological condition 
or agent. 

As we have exemplified below, selected profiles are in fact of themselves 
informative of and descriptive of biological condition, when they are obtained with precision 
as defined herein. The reason for this is that we have found that gene expression, properly 

15 measured- with requisite precision-is informative of the biological condition of the subject 
with respect to which gene expression has been measured. Further discussion of these points 
also occurs in our co-pending application, U.S. Application Serial No. 10/742,458, filed 
December 19, 2003, which is hereby incorporated by reference. That application, points out, 
for example, that algorithms can be fashioned that use selected profile data to provide single- 

20 valued indices that are informative, for example, of health or disease of a subject. But as 
shown there and herein, these matters can also be seen directly in the selected profile data. 

We have exemplified the use of selected panels of constituents corresponding to gene 
loci from which quantitative gene expression is determined by, for example, quantitatively 
measuring the transcribed RNA in a sample of a subject, for applications that include: (a) 

25 measurement of therapeutic efficacy of natural or synthetic compositions or stimuli that may 
be formulated individually or in combinations or mixtures for a range of targeted 
physiological conditions; (b) predictions of toxicological effects and dose effectiveness of a 
composition or mixture of compositions for an individual or in a population; (c) determining 
how two or more different agents administered in a single treatment might interact so as to 

30 detect any of synergistic, additive, negative, neutral or toxic activity (d) performing pre- 
clinical and clinical trials by providing new criteria for pre-selecting subjects according to 
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informative profile data sets for revealing disease status and conducting preliminary dosage 
studies for these patients prior to conducting phase 1 or 2 trials. Gene expression profiling 
may be used to reduce the cost of phase 3 clinical trials and may be used beyond phase 3 
trials; (e) labeling for approved drugs; (f) selection of suitable medication in a class of 
5 medications for a particular patient that is directed to their unique physiology; (g) diagnosing 
or determining a prognosis of a medical condition or an infection which may precede onset 
of symptoms or alternatively diagnosing adverse side effects associated with administration 
of a therapeutic agent; (h) managing the health care of a patient; and (i) quality control for 
different batches of an agent or a mixture of agents. 
10 The subject 

The methods herein can be applied to a subject that includes any living organism 
where a living organism includes a prokaryote such as a bacterium or a eukaryote including 
single celled eukaryotic organisms at one end of the spectrum and humans at the other and 
everything in between including plants. The figures relate to calibrated profile data sets 

15 obtained from humans and mammals. Nonetheless, the methods disclosed here may be 
applied to cells of other organism without the need for undue experimentation by one of 
ordinary skill in the art because all cells transcribe RNA and it is known in the art how to 
extract RNA from all types of cells. 

A tissue sample may include a single cell or multiple cells or fragments of cells. 

20 Body fluid includes blood, urine, spinal fluid, lymph, mucosal secretions, hemolymph or any 
other body fluid known in the art for a subject. For an animal subject, a tissue or fluid 
sample may be obtained by means of a biopsy needle aspirate, a lavage sample, scrapings 
and surgical incisions or other means known in the art. 
Selected Panels 

25 Steps in selecting constituents in a selected panel may include searching publicly 

available medical literature for RNA or proteins or sets of RNAs or proteins that directly or 
indirectly vary with a particular biological condition. A selected panel containing up to 100 
constituents may be selected. According to the condition being examined, just a small subset 
of the selected panel constituents may be informative. Therefore, selected panels may have 

30 as few as two constituents, or as many as 1000 or more constituents. It is possible that even 
a single constituent may make up a selected panel. In determining membership of the 
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selected panel of genes, it is not necessary for the panel to be an exhaustive selection. Rather 
it is desired to obtain from the selected panel an expression profile that discriminates 
consistently with respect to the targeted physiological or biological condition. Moreover, a 
selected panel is not necessarily selected according to an expected profile of gene expression 
5 in cells that directly respond to a biological effect. For example, gene expression associated 
with liver metabolism may be analyzed in a blood sample. Figures 20 and 22 provide 
calibrated profiles of whole blood treated with herbal agents using markers for liver 
metabolism. 

The number of constituents in a selected panel can vary. According to the examples 

10 provided below, selected panels of up to 24-96 genes are selected for evaluating expression 
levels. Although a selected panel may be as large as 100 constituents, it is desirable for a 
particular selected panel to have no more than 24 constituents, more particularly, less than 12 
constituents. For example, subsets of no more than 8 genes have been used that may be 
derived from a larger panel but which are sufficiently informative to effectuate 

15 discrimination. The number of constituents in a selected panel for which expression is 

monitored may vary widely depending on the context. For example, Figure 1 describes data 
acquisition from in vitro cell culture and from animal toxicology studies, which includes 
expression of about 25 to 100 or more genes. In contrast, selection of markers or surrogate 
markers include, for example, three to 100 genes, preferably five to 50 or five to 25 genes to 

20 be analyzed from samples obtained in clinical studies. In this manner markers or surrogate 
markers having predictive value for a medical condition, such as a genetic predisposition, a 
response to therapeutic agent, an inflammatory condition, or an infection, etc. can be 
identified and cumulatively larger populations can be obtained to refine the correlations. A 
health profile can then be generated for an individual subject using a low volume blood 

25 sample. The blood sample can be analyzed for expression profile data of about 100 - 500 
genes, comprising markers or surrogate markers of a number of medical conditions (Fig. 1: 
right panel). Selected panels of varying sizes may be utilized as necessary and subsequent 
refinements in methodology may lead to selection of subsets having selected panels as large 
as 15 genes or 12 genes or as small as 6, 5, 4, 3 or 2 genes. 

30 We have found that precision, including the concepts of repeatability and 

reproducibility, can be achieved in measurements for constituents in profile data sets for 
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selected panels wherein coefficients of variation are less than 3 percent 

For example, we have found that we can measure concentrations of constituents in 
selected panels in a manner that is both highly precise and reproducible in samples taken 
from the same individual under the same conditions. We have similarly found that such 
5 concentration measurements are reproducible and precise in samples that are repeatedly 
tested. 

We commonly run a sample through a panel in quadruplicate; that is, a sample is 
divided into aliquots and for each aliquot we measure concentrations of each constituent in a 
selected panel. Over a total of 900 constituent assays, with each assay conducted in 

10 quadruplicate, we found an average coefficient of variation, (standard 

deviation/average)* 100, of less than 1 percent among results for each assay. This figure is a 
measure of what we call "intra-assay variability". We have also conducted assays on 
different occasions using the same sample material. With 72 assays, resulting from 
concentration measurements of constituents in a panel of 24 members, and such 

15 concentration measurements determined on three different occasions over time, we found an 
average coefficient of variation of less than 2 percent. We regard this as a measure of what 
we call "inter-assay variability". 

We have found it valuable in using the quadruplicate test results to identify and 
eliminate data points that are statistical "outliers" ; such data points are those that differ by a 

20 percentage greater, for example, than 3% of the average of all four values and that do not 
result from any systematic skew that is greater, for example, than 1%. Moreover, if more 
than one data point in a set of four is excluded by this procedure, then all data for the relevant 
constituent is discarded. 

As discussed in further detail below under "Gene Expression", we have also found it 

25 valuable to optimize the efficiency of amplification for all constituents of a panel in a 

manner to achieve comparable amplification efficiencies (that is, amplification efficiencies 
that are substantially similar as described below under "Gene Expression") for all 
constituents, so that precise quantification of gene expression of all panel constituents may 
be determined consistently on successive occasions. In this manner, there may result data 

30 that is useful because it is precise and reliable. 

What this approach means, among other things, is that by utilizing a relatively small 
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panel, and by controlling experimental parameters for the whole panel, such as sampling, 
primer selection, amplification efficiency and other parameters, we create a panel that is 
uniquely informative. This approach differs from prior art endeavors where specificity is 
optimized only on a per-constituent basis and reaction conditions are not optimized for the 
5 panel as a whole. 

It is envisaged that any single biological condition may be described by a signature 
panel having a small number of highly informative constituents providing a signature 
calibrated profile (also referred to as a fingerprint). Such a signature panel may have as few 
as two or more constituents. The presence of highly informative loci is demonstrated in 

10 several of the accompanying figures. For example, Figure 1 1(a) 11-2, 11-4 and 11-5 are highly 
informative. Highly informative constituents in Figure 21 include the pro-inflammatory - 
interleukins. The signature panel may provide a signature profile or fingerprint which is 
sufficiently robust to serve as a standard in describing a particular biological condition or an 
effect of a particular agent on a biological condition 

15 For purposes of illustrating a signature panel, constituents of a selected panel for 

measuring inflammation have been provided that are informative with respect to a particular 
biological condition. For example, we have used a selected panel for inflammation that has 6 
constituents- II- la, 11-6, 11-8, 11-18, GMCSF and IFN-y in Figure 18(a)-(e) to determine the 
response of 5 subjects to varying concentrations of drugs. This group of constituents is a 

20 subset of a larger selected panel of inflammation related gene loci such as shown in Figure 
19a and Figure 19b where the Inflammation Selected Panel includes Il-a, Il-P, 11-2, 11-3, 11-4, 
11-6, 11-7, n-8, 11-10, Il-12p40, 11-15, 11-15, 11-18, GM-CSF, Ifn-gamma, TGF-0, cox-2, ICE, 
MMP-9, ICAM, TNF-a and TNF~p. The subset of constituents were selected on the basis of 
the information sought concerning the biological condition. 

25 Embodiments of the invention provide examples of numerous different selected 

panels which may be used separately or together. These selected panels include an 
Inflammation Selected Panel (Table 1) a Cell Growth and Differentiation Selected Panel 
(Table 2), a Liver Metabolism and Toxicity Selected Panel (Table 3). We have developed 
additional selected panels including Skin Response Selected Panel (Table 4), Prostate 

30 Selected Panel (Table 5)(for measuring prostate health and disease), Vascular Selected Panel 
(Table 6)(for measuring condition of the vascular system and endothelial cells). It is a 
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significant property of each of these selected panels that measurement of the selected panel's 
constituents provides a measurement of the physiological condition to which the selected 
panel is targeted. Selected panels may also provide useful information concerning gene 
response outside the target condition. In these tables the left-hand column identifies the 
5 particular gene loci, and the right-hand column describes proteins expressed by these loci. 
However, as described in detail below, embodiments of the present invention may utilize, for 
example, mRNA or protein expression products as constituents. While below we provide 
examples based primarily on the Inflammation Selected Panel and subsets of it, the 
approaches set forth herein are equally applicable to the other selected panels described 
10 above. Although provided as examples, the above selected panels are not intended to be 
limiting. 

Gene Expression 

For measuring the amount of a particular RNA in a sample, we have used methods 
known to one of ordinary skill in the art to extract and quantify transcribed RNA from a 

15 sample with respect to a constituent of a selected panel (See detailed protocols below.) 
Briefly, RNA is extracted from a sample such as a tissue, body fluid (see Example 1 1 
below), or culture medium in which a population of a subject might be growing. For 
example, cells may be lysed and RNA eluted in a suitable solution in which to conduct a 
DNAse reaction. First strand synthesis (see Example 10 below) may then be performed 

20 using a reverse transcriptase. Gene amplification, more specifically quantitative PCR 

assays, can then be conducted and the gene of interest size calibrated against a marker such 
as 18S rRNA (Hirayama et aL, Blood 92, 1998: 46-52). Samples are measured in multiple 
duplicates, for example, 4 replicates. Relative quantitation of the mRNA is determined by 
the difference in threshhold cycles between the internal control and the gene of interest (see 

25 Example 12 below). In an embodiment of the invention, quantitative PCR is performed using 
amplification, reporting agents and instruments such as those supplied commercially by 
Applied Biosy stems (Foster City, CA). Given a defined efficiency of amplification of target 
transcripts, the point (e.g., cycle number) that signal from amplified target template is 
detectable may be directly related to the amount of specific message transcript in the 

30 measured sample. Similarly, other quantifiable signals such as fluorescence, enzyme 
activity, disintegrations per minute, absorbance, etc., when correlated to a known 
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concentration of target templates (e.g., a reference standard curve) or normalized to a 
standard with limited variability can be used to quantify the number of target templates in an 
unknown sample. 

Although not limited to amplification methods, quantitative gene expression 
5 techniques may utilize amplification of the target transcript. Alternatively or in combination 
with amplification of the target transcript, amplification of the reporter signal may also be 
used. Amplification of the target template may be accomplished by isothermic gene 
amplification strategies, or by gene amplification by thermal cycling such as PCR. 
It is desirable to obtain a definable and reproducible correlation between the 

10 amplified target or reporter and the concentration of starting templates. We have discovered 
that this objective can be achieved by careful attention to, for example, consistent primer- 
template ratios and a strict adherence to a narrow permissible level of amplification 
efficiencies (for example 99.8 to 100% relative efficiency). For example, in determining 
gene expression levels with regard to a single selected profile, it is necessary that all 

15 constituents of the panels maintain a similar and limited range of primer template ratios (for 
example, within a 10-fold range) and amplification efficiencies (within, for example, less 
than 1%) to permit accurate and precise relative measurements for each constituent. We 
regard amplification efficiencies as being "substantially similar", for the purposes of this 
description and the following claims, if they differ by no more than approximately 10%. 

20 Preferably they should differ by less than approximately 2% and more preferably by less than 
approximately 1%. These constraints should be observed over the entire range of 
concentration levels to be measured associated with the relevant biological condition. In 
practice, we run tests to assure that these conditions are satisfied. For example, we typically 
design and manufacture a number primer-probe sets, and determine experimentally which set 

25 gives the best performance. Even though primer-probe design and manufacture can be 
enhanced using computer techniques known in the art, and notwithstanding common 
practice, we still find that experimental validation is useful. Moreover, in the course of 
experimental validation, we associate with the selected primer-probe combination a set of 
features: 

30 (i) The reverse primer should be complementary to the coding DNA strand; located 

across an intron-exon junction, with not more than three bases of the three-prime end of the 
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reverse primer complementary to the proximal exon. (If more than three bases are 
complementary, then it would tend to competitively amplify genomic DNA.) 

(ii) The primer probe should amplify cDNA of less than 1 10 bases in length. 

(iii) The primer probe should not amplify genomic DNA or transcripts or cDNA from 
5 related but biologically irrelevant loci. 

A suitable target of the selected primer probe is first strand cDNA, which may be 
prepared, in one embodiment, according to Example 1 below. In Example 1 1 below, we 
illustrate use of the primer probe with the first strand cDNA of Example 1 to permit 
measurement of constituents of a selected panel. 

10 It is envisaged that techniques in the art using microfluidics for example and highly 

sensitive markers will enable quantitation of RNA to occur directly from a single cell or 
lysed cell. This may rely on amplification of a marker but may not require amplification of 
the transcripts themselves. The amount of transcript measured for any particular locus is a 
data point or member of the first profile data set for a particular selected panel. 

15 According to embodiments of the invention, a first profile data set is derived from the 

sample, the first profile data set including a plurality of members, each member being a 
quantitative measure of the amount of a RNA transcribed from a gene locus, the gene locus 
being a constituent in a panel of constituents. A first profile data set may be obtained from a 
quantitative measure of the amount of a distinct RNA or protein corresponding to a gene 

20 locus. Each member of the profile data set should be reproducible within a range with 

respect to similar samples taken from the subject under similar conditions. For example, the 
profile data sets may be reproducible within one order of magnitude with respect to similar 
samples taken from the subject under similar conditions. More particularly, the members 
may be reproducible within 50%, more particularly reproducible within 20%, and sometimes 

25 even within 10% or less than within 3%. In addition, each member of the profile data set 
should be repeatable within a range with respect to multiple assays of aliquots of the same 
sample. For example, the profile data sets may be repeatable within one order of magnitude 
with respect to multiple assays of aliquots of the same sample. More particularly, the 
members may be repeatable within 50%,more particularly repeatable within 20%, and 

30 sometimes even within 10% or less than within 3%. 

In accordance with embodiments of the invention, a pattern of increasing, decreasing 

30 



2331/130 

and no change in relative gene expression from each of a plurality of gene loci examined in 
the precision panel may constitute profile sets that are informative with regards to a 
biological condition, biological efficacy of an agent, treatment conditions or for comparison 
to populations. Patterns of this nature may be used to identify likely candidates for a drug 
5 trial, used in combination with other clinical indicators to be diagnostic or prognostic with 
respect to a biological condition or may be used to guide the development of a 
pharmaceutical or nutraceutical through manufacture, testing and marketing. Such profile 
data sets may also be used to guide the allopathic or naturpathic treatment of the particular 
biological condition that the profile data set describes. 

10 The numerical data obtained from quantitative gene expression and numerical data 

from gene expression relative to a baseline profile data set may also be stored in databases or 
digital storage mediums and may be retrieved for purposes including managing patient health 
care or for conducting clinical trials or for characterizing a drug. The data may be 
transferred in networks via the World Wide Web , email, or an internet access site, for 

15 example, or by hard copy so as to be collected and pooled from distant geographic sites 

The figures provided here are directed to RNA. However, methods herein may also 
be applied using proteins where sensitive quantitative techniques, such as an Enzyme Linked 
Immunosorbent Assay (ELISA), or amplification of nucleic acid aptamers that bind 
specifically to gene expression products, as detailed in our co-pending application, U.S. 

20 Application Serial No. 09/595,720, filed June 16, 2000, which is hereby incorporated by 
reference. These methods and others are available and well-known in the art for measuring 
the amount of a protein constituent. 
Baseline profile data sets 

The analyses of samples from single individuals and from large groups of individuals 

25 provide a library of profile data sets relating to a particular panel or series of panels. These 
profile data sets may be stored as records in a library for use as baseline profile data sets, or 
may themselves describe a biological condition at the time and circumstances of the 
particular sample from which the profile data set is obtained, and be stored as records in a 
library for a particular biological condition for that particular sample. As the term "baseline" 

30 suggests, the stored baseline profile data sets serve as comparators for providing a calibrated 
profile data set that is informative about a biological condition or agent. Baseline profile 
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data sets may be stored in libraries and classified in a number of cross-referential ways. One 
form of classification may rely on the characteristics of the panels from which the data sets 
are derived. Another form of classification may be by particular biological condition. The 
concept of biological condition encompasses any state in which a cell or population of cells 
5 may be found at any one time. This state may reflect geography of samples, sex of subjects 
or any other discriminator. Some of the discriminators may overlap. The libraries may also 
be accessed for records associated with a single subject or particular clinical trial. The 
classification of baseline profile data sets may further be annotated with medical information 
about a particular subject, a medical condition, a particular agent etc. 

10 The choice of a baseline profile data set for creating a calibrated profile data set is 

related to the biological condition to be evaluated, monitored, or predicted, as well as, the 
intended use of the calibrated panel, e.g., as to monitor drug development, quality control or 
other uses. It may be desirable to access baseline profile data sets from the same subject for 
whom a first profile data set is obtained or from different subject at varying times, exposures 

15 to stimuli, drugs or complex compounds; or may be derived from like or dissimilar 
populations. 

The profile data set may arise from the same subject for which the first data set is 
obtained, where the sample is taken at a separate or similar time, a different or similar site or 
in a different or similar physiological condition. For example, Figure 5 provides a protocol 

20 in which the sample is taken before stimulation or after stimulation. The profile data set 

obtained from the unstimulated sample may serve as a baseline profile data set for the sample 
taken after stimulation, or it may describe a biological condition for a given subject or 
population of cells from which the sample was obtained at a particular time and under 
particular circumstances. The baseline data set may also be derived from a library containing 

25 profile data sets of a population of subjects having some defining characteristic or biological 
condition. The baseline profile data set may also correspond to some ex vivo or in vitro 
properties associated with an in vitro cell culture. The resultant calibrated profile data sets 
may then be stored as a record in a database or library (Figure 6) along with or separate from 
the baseline profile data base and optionally the first profile data set although the first profile 

30 data set would normally become incorporated into a baseline profile data set under suitable 
classification criteria. 
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Selected baseline profile data sets may be also be used as a standard by which to 
judge manufacturing lots in terms of efficacy, toxicity, etc. Where the effect of a therapeutic 
agent is being measured, the baseline data set may correspond to gene expression profiles 
taken before administration of the agent. Where quality control for a newly manufactured 
5 product is being determined, the baseline data set may correspond with a gold standard for 
that product. However, any suitable normalization techniques may be employed. For 
example, an average baseline profile data set is obtained from authentic material of a 
naturally grown herbal nutraceutical and compared over time and over different lots in order 
to demonstrate consistency, or lack of consistency, in lots of compounds prepared for release. 

10 Calibrated data 

A calibrated profile data set may be described as a function of a member of a first 
profile data set and a corresponding member of a baseline profile data set for a given gene 
locus in a panel. For example, calibrated profile data sets may be derived by calculating a 
ratio of the amount of RNA transcribed for a panel constituent in a cell sample in an 

15 environmental including intervention such as a therapeutic treatment or at a particular time 
(first profile data set) with respect to the amount of RNA transcribed for the same panel 
constituent in a cell that differs in some manner from the sample (baseline profile data set) 
(Figures 5 and 6). Given the precision we have achieved in measurement of gene expression, 
described above in connection with "selected panels" and "gene amplification", we conclude 

20 that where differences occur in measurement under such conditions, the differences are 

attributable to differences in biological condition. Thus we have found that calibrated profile 
data sets are highly reproducible in samples taken from the same individual under the same 
conditions. We have similarly found that calibrated profile data sets are reproducible in 
samples that are repeatedly tested. We have also found repeated instances wherein calibrated 

25 profile data sets obtained when samples from a subject are exposed ex vivo to a compound 
are comparable to calibrated profile data from a sample that has been exposed to a sample in 
vivo (Figure 14, Figure 16 (a), (b), and Figures 29(a) and 29(b)). We have also found, 
importantly, that an indicator cell line treated with an agent can in many cases provide 
calibrated profile data sets comparable to those obtained from in vivo or ex vivo populations 

30 of cells (Figure 15). Moreover, we have found that administering a sample from a subject 
onto indicator cells can provide informative calibrated profile data sets with respect to the 
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biological condition of the subject including the health, disease states, therapeutic 
interventions, aging or exposure to environmental stimuli or toxins of the subject (Figure 25). 

A use of a calibrated profile data set is to evaluate a biological condition of a subject. 
This may be for purposes of diagnosis or prognosis of a clinical disorder. It is desirable to 
5 obtain a calibrated data set that describes a state of health or alternatively a state of age or 
body mass or any condition or state that an individual subject might find themselves to be in. 
For example, the biological condition may relate to physical activity, conditioning or 
exercise, mental state, environmental factor such as medication, diet, or geography or 
exposure to radiation or environmental contamination or infectious agent, biological or 

10 environmental toxin. If health or conversely a clinical disorder is being evaluated, calibrated 
profiles data sets may be used for monitoring change in health status by periodic or regular 
comparison of profiles; the disorder may be a complex disease process possibly involving 
multiple gene including inflammation, autoimmune disease, degenerative disease, allergy, 
vascular disease, ischemia, developmental disease, hormonal conditions and infectious 

15 diseases. The clinical disorder may further include arthritis, asthma, multiple sclerosis and 
perimenopausal changes. The biological condition may affect a system of a subject 
including a respiratory, vascular, nervous, metabolic, urinary, reproductive, structural and 
immunological system or other metabolic state. The above examples of a biological 
condition are given by way of illustration and are not intended to be limiting. 

20 Similarly, calibrated profile data sets may be used to measure, monitor or predict the 

host response to an infectious agent for purposes of identifying the infectious agent, 
assessing the duration of infection, the extent of exposure or making therapeutic decisions. 

The evaluation of activity of an agent may require a series of calibrated profiles. It is 
here shown that calibrated profile data sets may be used to describe the biological activity of 

25 an agent that may be a single compound or a complex compound such as a nutraceutical or 
herbal. The agent may be assayed using indicator cells, ex vivo cell populations or by in vivo 
administration. These assays may rely on a series of signature panels or enlarged panels for 
different biological conditions. The resultant calibrated profiles may then be used to infer 
likely in vivo activity from the in vitro study. Insights into toxicity and mechanisms of action 

30 can also be inferred from calibration profile data sets. For example, the herbal Echinacea is 
believed to have both immunostimulatory and anti-inflammatory properties although neither 
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has been measured systematically. We have provided a systematic approach to investigate 
the biological activities of these and other herbs. We investigated the alleged 
immunostimulatory properties of the herbs by comparing the effect of treating the indicator 
cell line THP-1 or peripheral blood cells with the agent to untreated cells. Untreated cells 
5 include LPS stimulated untreated cells. Untreated cells were used as a baseline profile data 
set to measure the difference in gene expression between a baseline profile data set and the 
experimental treatment with the compound. Baseline profile data sets included a single 
sample or an average value from a series of experiments. The resultant calibrated profile 
data sets could then be compared with a library of calibrated profile data sets for a particular 

10 herb or/and libraries associated with different agents or conditions. 

From the information obtained about a previously undescribed agent, a signature 
panel may be derived optionally together with a signature profile to serve as a gold standard 
for testing other batches of the same agent. 
Calculation of calibrated profile data sets and computational aids 

15 The function relating the baseline and profile data sets is, in an embodiment of the 

invention, a ratio expressed as a logarithm. The calibrated profile data set may be expressed 
in a spreadsheet or represented graphically for example, in a bar chart or tabular form but 
may also be expressed in a three dimensional representation. The constituent may be 
itemized on the x-axis and the logarithmic scale may be on the y-axis. Members of a 

20 calibrated data set may be expressed as a positive value representing a relative enhancement 
of gene expression or as a negative value representing a relative reduction in gene expression 
with respect to the baseline. 

Each member of the calibrated profile data set should be reproducible within a range 
with respect to similar samples taken from the subject under similar conditions. For 

25 example, the calibrated profile data sets may be reproducible within one order of magnitude 
with respect to similar samples taken from the subject under similar conditions. More 
particularly, the members may be reproducible within 50%, more particularly reproducible 
within 20%, and sometimes even 10%. In accordance with embodiments of the invention, a 
pattern of increasing, decreasing and no change in relative gene expression from each of a 

30 plurality of gene loci examined in the precision panel may be used to prepare a calibrated 
profile set that is informative with regards to a biological condition, biological efficacy of an 
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agent treatment conditions or for comparison to populations. Patterns of this nature may be 
used to identify likely candidates for a drug trial, used in combination with other clinical 
indicators to be diagnostic or prognostic with respect to a biological condition or may be 
used to guide the development of a pharmaceutical or nutraceutical through manufacture, 
5 testing and marketing. 

The numerical data obtained from quantitative gene expression and numerical data 
from calibrated gene expression relative to a baseline profile data set may be stored in 
databases or digital storage mediums and may retrieved for purposes including managing 
patient health care or for conducting clinical trials or for characterizing a drug. The data may 
10 be transferred in networks via the World Wide Web, email, or internet access site for 
example or by hard copy so as to be collected and pooled from distant geographic sites 
(Figure 8). 

In an embodiment of the present invention, a descriptive record is stored in a single 
database or multiple databases where the stored data includes the raw gene expression data 
15 (first profile data set) prior to transformation by use of a baseline profile data set, as well as a 
record of the baseline profile data set used to generate the calibrated profile data set 
including for example, annotations regarding whether the baseline profile data set is derived 
from a particular signature panel and any other annotation that facilitates interpretation and 
use of the data. 

20 Because the data is in a universal format, data handling may readily be done with a 

computer. The data is organized so as to provide an output optionally corresponding to a 
graphical representation of a calibrated data set. 

For example, a distinct sample derived from a subject being at least one of RNA or 
protein may be denoted as Pi . The first profile data set consists of Mj where Mj is a 

25 quantitative measure of a distinct RNA or protein constituent. The record Ri is a ratio of M 
and P and may be annotated with additional data on the subject relating to for example, age, 
diet, ethnicity, gender, geographic location, medical disorder, mental disorder, medication, 
physical activity, body mass and environmental exposure. Moreover, data handling may 
further include accessing data from a second condition database which may contain 

30 additional medical data not presently held with the calibrated profile data sets. In this 
context, data access may be via a computer network. 
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The above described data storage on a computer may provide the information in a 
form that can be accessed by a user. Accordingly, the user may load the information onto a 
second access site including downloading the information. However, access may be 
restricted to users having a password or other security device so as to protect the medical 
5 records contained within. A feature of this embodiment of the invention is the ability of a 
user to add new or annotated records to the data set so the records become part of the 
biological information. 

The graphical representation of calibrated profile data sets pertaining to a product 
such as a drug provides an opportunity for standardizing a product by means of the calibrated 

10 profile, more particularly a signature profile. The profile may be used as a feature with 

which to demonstrate relative efficacy, differences in mechanisms of actions, etc. compared 
to other drugs approved for similar or different uses. 

The various embodiments of the invention may be also implemented as a computer 
program product for use with a computer system. The product may include program code for 

15 deriving a first profile data set and for producing calibrated profiles. Such implementation 
may include a series of computer instructions fixed either on a tangible medium, such as a 
computer readable medium (for example, a diskette, CD-ROM, ROM, or fixed disk), or 
transmittable to a computer system via a modem or other interface device, such as a 
communications adapter coupled to a network.. The network coupling may be for example, 

20 over optical or wired communications lines or via wireless techniques (for example, 

microwave, infrared or other transmission techniques) or some combination of these. The 
series of computer instructions preferably embodies all or part of the functionality previously 
described herein with respect to the system. Those skilled in the art should appreciate that 
such computer instructions can be written in a number of programming languages for use 

25 with many computer architectures or operating systems. Furthermore, such instructions may 
be stored in any memory device, such as semiconductor, magnetic, optical or other memory 
devices, and may be transmitted using any communications technology, such as optical, 
infrared, microwave, or other transmission technologies. It is expected that such a computer 
program product may be distributed as a removable medium with accompanying printed or 

30 electronic documentation (for example, shrink wrapped software), preloaded with a computer 
system (for example, on system ROM or fixed disk), or distributed from a server or 
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electronic bulletin board over a network (for example, the Internet or World Wide Web). In 
addition, a computer system is further provided including derivative modules for deriving a 
first data set and a calibration profile data set. 
Clinical trials 

5 The use of calibrated profile data sets for performing clinical trials is illustrated in 

Figure 10 using the above-described methods and procedures for running a clinical trial or 
managing patient care. Moreover, standardization between laboratories may be achieved by 
using a particular indicator cell line such as THP-1 which is stimulated by a known 
stimulator such as lipopolysaccharide so that resultant profile acts as a measure that the 
10 laboratory is performing the protocol correctly. Of course this is one single example, and 

other cells lines, tissues, or biological samples or combinations of the foregoing may be used 
as standards. 

A further embodiment of the invention provides a method for patient selection for 
augmenting clinical trials. Clinical trials in which candidate subjects are included or 

15 excluded according to a predetermined optimum calibrated profile for a given biological 

condition can result in more precise monitoring than would be otherwise possible. It can also 
result in a greater efficiency in clinical trial design because unsuitable patients that have for 
example complicating factors or conditions can be screened out. The calibrated profile data 
will also enhance the "signal to noise" by removing non-responders from clinical studies. 

20 The basic structure of a clinical trial design using gene expression profiling may follow any 
of several formats. These include testing body fluid from a candidate patient in the trial ex 
vivo against a new therapeutic agent and analyzing the calibrated profiles with respect to an 
agent-treated and placebo-treated samples using a predetermined selected panel and 
evaluating whether the candidate patient would be likely to respond without adverse effects 

25 to the composition being tested. In selected indications, profile data obtained from in vitro 
cell cultures or organ cultures may be desired where the cell originates from a target subject 
or from another subject or from an established cell line, or from a cell samples removed from 
the target subject where the cell samples may be obtained from any body fluid including a 
blood, urine, semen, amniotic, or a cerebrospinal fluid sample, or from a scraping from 

30 mucosal membranes such as from the buccal cavity, the eye, nose, vagina or by means of a 
biopsy including epithelial, liver, sternum marrow, testicular, or from tumor tissue removed 
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surgically from a tumor at any location. The above-described sources of samples are 
applicable to any medical use in which calibrated profile data sets are desired. 

In vitro dosage and toxicity studies using calibrated profile data sets obtained from 
indicator cell lines or samples of the patient tested ex vivo may provide useful information 
5 prior to initiation of the clinical trial and may significantly reduce the cost and time of a 
clinical trial while increasing the likelihood of identifying the presence of beneficial 
effect(s). In particular, the dose may be optimized on an individualized basis to maximize 
the impact on therapeutic outcome. For example, Figure 12 shows how ex vivo blood cells 
respond to the stimulatory effect of LPS and the subsequent treatment with an anti- 

10 inflammatory drug (methotrexate, meclofenamate or methylprednisolone). The data show 
how the effect of methotrexate and meclofenamate generates similar calibrated profile data 
sets where the baseline is LPS treated blood. In contrast, methylprednisolone has a 
substantially different effect from the other two compounds. A similar type of analysis can 
be performed with complex mixtures, as illustrated in Figure 21, in which the calibrated 

15 profiles obtained when Echinacea, Arnica and Siberian Ginseng applied to LPS stimulated 
blood ex vivo are compared. In this example, all three agents appear to act differently from 
each other with respect to a sample from a single subject. Similar analyses can be used to 
compare compounds with unknown targets or activities or metabolic patterns to compounds, 
complex or simple, with known or pre-determined profiles. 

20 The above methods and procedures may be utilized in the design and running of 

clinical trials or as a supplemental tool. Moreover, the above methods and procedures may 
be used to monitor the patients' health as well as the patient's responsiveness to an agent 
before during and after the clinical trial. This includes monitoring whether multiple agents 
interfere with each other, act synergistically or additively or are toxic or neural with respect 

25 to each other. This type of information is very important as individuals take an increasing 
number of medications. 

Similarly, the methods and procedures described above may be used to manage 
patient care for an individual or a population. Such methods and procedures may also be 
used to develop a regional or global research network that uses calibrated profile data sets 

30 and the resulting databases to conduct research or trials. 

Both the calibration profile data sets in graphical form and the associated databases 
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together with information extracted from both are commodities that can be sold together or 
separately for a variety of purposes. For example, graphic representations of calibration 
profile data sets may provide a description of a product with respect to its activity that may 
be used to promote the product. Alternatively, the graphical form of the calibrated profile 

5 data sets and access to baseline profile databases provide a means for manufacturers to test 
discrete batches of product against a gold standard. 

The data may be used strategically for design of clinical trials. It may also be useful 
for physicians practicing at remote sites to offer personalized healthcare to a patient. 
Accordingly, the physician may set up personalized databases for calibrated profile data sets 

10 prior to and after treatment of a particular condition. New data on the subject could be added 
to the personalized database at each visit to the doctor. The data may be generated at remote 
sites by the use of kits that permit a physician to obtain a first profile data set on a sample 
from a patient. For remote users to access the site, it is envisaged that secured access to the 
global network containing libraries of baseline profile data sets and calibrated profile data 

15 sets, classified by particular criteria and representing data from larger populations than a 
single individual, would be necessary. The access to the global database may be password 
protected thereby protecting the database from corrupted records and safeguarding personal 
medical data. The graphical form provided by the calibrated data sets may be used to create 
catalogs of compounds in a pharmacopiae complete with toxic effects that might arise for 

20 particular individuals as well as other types of drug interactions. 

Access to the global database may include the option to load selected data onto a 
second access site. This process may include downloading the information to whatever site 
is desired by the user and could include securing hard copies of information. It is desirable 
to control how and what data is offloaded or copied to maintain the integrity of the database. 

25 It is envisaged that while a global network of clinical data would be an informational 
resource, it would have utility in conducting research that may include epidemiological 
studies and studies concerning the mechanism of action of an agent, as well as studies 
concerning the nature of interpersonal variability as determined by calibrated profile data 
sets. 

30 Examples of Medical uses 

(a) Early detection of infectious diseases: Markers or surrogate markers from 
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mice may be obtained for measuring gene expression in humans that indicate early or 
immediate response to infection, for example, to a virus such as hepatitis virus, or to a 
bacterium such as Mycobacterium tuberculosis (the etiologic agent of tuberculosis) (see 
Figure 4). Candidate genes are identified and changes in expression of those genes in the 
5 presence of a challenge provide a set of markers. The set of markers can combine markers 
encoded by the genome of the subject and one more distinctive markers encoded by the 
genome of the infectious agent. For example, changes in expression of an immediate early 
gene of a virus, e.g. a gene encoding an enzyme of viral replication, and a host gene such as 
the gene for any or all of IL-2, IL-4 and IL-5, may comprise markers or surrogate markers 

10 for a medical condition capable of detecting that condition prior to the onset of medical 
symptoms. This method may afford earlier detection of an infection than is possible using 
current diagnostic techniques. 

(b) Toxicity profiles and mechanistic profiles obtained from an in vitro assay and 
in vivo assays. Toxicity and mechanistic information arising from the administration of a 

15 compound to a population of cells may be monitored using calibrated profile data sets. The 
following is an example of an experimental protocol for obtaining this information. Firstly, 
an experimental group is established: (1) control cells maintained without therapeutic agent 
and without stimulus; (2) cells treated with therapeutic agent but without stimulus; (3) cells 
without therapeutic agent but with stimulus, (4) sample with therapeutic agent and with 

20 stimulus. The population of cells can be selected from primary cell cultures prepared in 
culture plates using methods well established in the art; or mature differentiated cell 
preparation from whole blood or isolated monocytes from the target organism. 

The cells are stimulated so as to present a targeted physiological condition by 
pretreatment with LPS purified from a Gram-negative bacterium (a variety of LPS 

25 preparations from pathogenic bacteria, for example, from Salmonella typhimurium and from 
Escherichia coli 01 157:H7, are available from Sigma, St. Louis, MO). The therapeutic 
agent administered to the cell samples in this example is an inhibitor of an enzyme known to 
be key in disease etiology, namely an inhibitor of a protease or a nucleic acid polymerase. 
Following treatment by addition of the therapeutic agent and further incubation for four to 

30 six hours, samples of the cells are harvested and analyzed for gene expression. Nucleic acid, 
specifically mRNA, can be prepared from the sample by methods known to one or ordinary 
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skill in the art (see, for example, the Lyse-N-Go™ reagent, Pierce Chem. Co., Rockford, IL). 
Samples are analyzed by QPCR according to a quantitative replicative procedure, (for 
example, quantitative polymerase chain reaction procedure (QPCR)) (see, for example, 
Gibson,U. 1996 Genome Res.6:995-1001, and references cited therein). Total RNA was 
5 assessed using universal primers. Toxicity of the agent for cells can be measured in 
untreated cells by vital stain uptake, rate of DNA synthesis (autoradiography of labeled 
nucleic compared to cells stained), stain by DNA-specific eyes (Hoechst), etc. Mechanistic 
profiles can be determined by analysis of the identifies of de novo up- or down-regulated 
genes. Further, in the presence of a therapeutic agent, some genes are not expressed or 

10 differentially expressed, indicating potential efficacy of the therapeutic agent in suppressing 
the effects of stimulation by the LPS. For example, in Figure 21, levels of ICE that are 
somewhat stimulated in the presence of LPS +Echinacea are substantially depressed by LPS 
+ Arnica relative to LPS stimulated cells absent agent. Levels of HSP 70 which are 
depressed in the presence of LPS+Echinacea are substantially stimulated in the presence of 

15 LPS +Arnica, and LPS + Siberian Ginseng relative to LPS stimulated cells absent the 

addition of an agent. Levels of IL-12p40 which are slightly increased in the presence of LPS 
+ Echinacea are substantially depressed in the presence of LPS + Arnica and LPS +Siberian 
Ginseng relative to LPS stimulation. Similarly, Figure 16 shows a much enhanced reduction 
of gene expression in whole blood for IL-la, Il-lp, H-7, 11-10, IL-DL-15, IFN-y, TGF-p, 

20 TNF-p cox-2, and ICAM in the presence of prednisolone +LPS when compared to arnica 
+LPS or nothing +LPS. 

(c) Quantitation of gene expression in a blood cell to predict toxicity in another 
tissue or organ. 

Leukocytes, for example, may be obtained from a blood sample of a subject, for the 
25 purpose of assessing the appearance of a pathological condition in another organ, for 

example, the liver. A profile data set is obtained of genes expressed in the leukocytes, for 
example, genes encoding a set of lymphokines and cytokines. The data set is compared to 
that of the database, to examine correlations, for example to other subjects, and to the subject 
prior to administration of a therapeutic agent. 
30 By this method, a correlation can be drawn between, for example, administration of 

acetaminophen (Tylenol) and sensitivity to this therapeutic agent and manifested by liver 
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damage. An early prediction of therapeutic agent sensitivity, detected prior to the onset of 
actual damage to the liver, may be clinically available so that the subject receives no further 
administration of acetaminophen. The database may be used to detect a correlation or 
correlations prior to the onset of traditional medical assessments, such as increase in bilirubin 
5 level or other indication of liver pathology. 

(d) Calibrated profiles from blood cells for prognosis of severity and prediction of 
adverse reactions in treatment of an autoimmune disease. 

The probability and timing of onset of symptoms of an autoimmune disease, for 
example, rheumatoid arthritis, may be monitored by appearance of expression of markers or 

10 surrogate markers as determined by the methods of gene expression profiling of markers or 
surrogate markers and comparison to a profile database as described above. Thus an 
indication of onset may be obtained, and advance management by utilization of preventive 
measures to forestall onset, can be taken. Further, the user may choose a set of potential 
therapeutic agents, and assess for a given agent, the probability that a subject will present an 

15 adverse reaction if given a full course of treatment, prior to that full course. For example, 
using embodiments of the invention, a single dose or a few doses of the agent methotrexate 
may be administered to a subject having arthritis and in need of a therapeutic agent. If the 
gene expression profile data set of the subject in response to the short course of methotrexate 
correlates with data sets from subjects having adverse reactions to this agent, then 

20 administration of a full course of methotrexate is counterindicated. Conversely, if the gene 
expression profile data set correlates with those of subjects who have responded positively to 
administration of a course of methotrexate treatment, then this therapeutic agent can be 
administered to the subject with much lower probability of adverse reaction. 
Discussion of Figures 

25 Figures 1-4 illustrate some of the applications of calibrated profile data sets. In 

Figure 1, three possible scenarios are provided. Firstly, a candidate therapeutic agent may be 
tested to determine its molecular pharmacology and toxicology profiles. The test might 
include obtaining calibrated profile data sets for a series of selected panels selected on the 
basis of what activity is predicted for the drug. The population of cells exposed to the agent 

30 may be the result of in vivo administration as depicted by the mouse or direct exposure in 
vitro where the cells may be an indicator cell line or an ex vivo sample from the subject. The 
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result of the screen is the identification of more effective drug candidates for testing in 
human subjects. 

The second scenario in Figure 1 is the use of calibrated profile data sets to identify a 
suitable clinical population for screening a potential therapeutic agent. Both demonstration of 
5 lack of toxicity and demonstration of clinical efficacy require certain assumptions about the 
clinical population. The calibrated profile data sets provide a means for establishing those 
assumptions with respect to the biological condition of the individuals selected for the 
clinical trials. 

The third scenario in Figure 1 involves the practice of individualized medicine, which 
10 may include creating an archive of calibrate profile data sets on the individual in a state of 
health such that changes can be identified using signature panels so as to permit evaluation, 
prognosis, or diagnosis of a particular condition. Moreover, stored information about the 
patient in the form of calibrated profile data sets permits selecting one of a group of possible 
therapeutic agents most likely to be effective for the patient, optimizing dosage of drug, and 
15 detecting adverse effects that might arise through drug-drug interactions before symptoms 
arise. Use of calibrated profile data sets may provide more efficient and cost-effective health 
care management. 

The novel approach described above for evaluating a biological condition of a subject 
may be applied to an ex vivo or in vitro assay for measuring the effect of an agent on a 

20 biological condition as illustrated in Figures 2-4. A sample from the patient may measured 
directly ex vivo or tested ex vivo against an agent to predict an effect in the patient. This 
provides a quick and effective way to determine which drug, chosen from within a single 
class of drugs that all may be used to treat a particular condition, may be most effective for a 
given subject. Alternatively, an agent may be tested on an indicator cell line that can provide 

25 a quantitative measure of therapeutic performance in a class of individuals. 

Figure 2 illustrates how calibrated profile data sets may assist in screening a library of 
candidate compounds to discover candidate drugs. Starting with for example, 500 candidate 
drugs, these can be tested in indicator cells or ex vivo body fluid or tissues against signature 
panels for in vitro toxicology or metabolic indicators. The figure illustrates the large number 

30 of compounds that entered in late stages in the development process only to ultimately be 
rejected due to adverse biological interactions. Use of calibrated profile data sets may in 
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many instances more readily identify likely successful candidates and thereby reduce the 
expense and untoward effects of animal and human experimentation for compounds that 
could have been predicted to fail. 

Figure 3 illustrates how a compound may be administered to an experimental animal 
5 such as a mouse or to an indicator cell line. The in vivo or ex vivo or indicator cell sample 
may further be treated with a stimulus. The result of both the compound and the stimulus 
may then be detected, for example, using signature profiles for toxicity or for mechanism to 
compare the effect of no drug +/- stimulus or +/-drug and no stimulus. Both in vitro (left 
panel of Figure 3) and in vivo (right panel of Figure 3) studies can be used to evaluate the 

10 effect of a compound (drug, nutraceutical, environmental stimuli, etc.). The right hand panel 
also illustrates the specific embodiment of an "in vitro clinical trial", that is, treatment of 
cells obtained from a subject and treated with a compound (with or without a stimulus) in 
vitro (or ex vivo) in order to predict the outcome of similar treatment of the subject in vivo 
(see Fig. 15 for a specific example). The output from both panels is described as toxicity and 

15 mechanistic profiles. Either experimental course may be used to both evaluate potential 
toxicity, e.g., using the toxicity, or liver metabolism selected panels, and to determine or 
confirm likely mechanism of action by a critical selection of a gene panel(s) that illustrates 
and differentiates molecular mechanisms of action (see Figure 12 for a specific example). 
These are merely examples, and other selected panels may be employed to evaluate or 

20 characterize other biological effects or conditions. Figure 4 illustrates a bioassay in which 
cells are removed from the subject and tested ex vivo with the addition of a compound and 
also a challenge or stimulus. The ex vivo effect of stimulus and then drug on whole blood 
taken from a human subject is shown in Figure 12 in which the stimulus is 
lipopolysaccharide (an inflammatory agent) while the drug is any of methotrexate, 

25 meclofenamate or methylprednisolone using a signature panel for inflammation. 

Methylprednisolone, a drug commonly used in the treatment of acute exacerbations of COPD 
as well as in the chronic management of this disease, is considered to be a potent by non- 
specific anti-inflammatory agent. However, as demonstrated in Figure 22, its effects on gene 
expression are dependent on the stimulus. While there are general qualitative similarities 

30 between the effects on gene expression across these three stimuli, there are both quantitative 
and qualitative differences that may be important in understanding when glucocorticoid 
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intervention is warranted. 

According to embodiments of the invention, an indicator cell population is used to 
measure quantitative gene expression the effect of an agent or a biological sample may 
influence the choice of which indicator cell line will be most informative. For example, a 
5 cloned cell line such as THP-1 or a primary cell population (peripheral mononuclear cells) 
may provide information that is comparable to that obtained from a body sample directly (see 
Figure 15). The normal state of gene expression may range from zero or few transcripts to 

10 5 or more transcripts. 

Similarly, an agent may be evaluated for its effect on any population of cells, either in 

10 vivo, ex vivo or in vitro, by administering the agent and then determining a calibrate profile 
data set for those cells under the selected conditions. Examples of this approach are provided 
in Figures 10-16 and 18. Figure 18 further provides calibrated profile data sets for different 
concentrations of a single agent showing that the transcription of selected constituents vary 
with dose and therefore the anticipated effectiveness with respect to the biological condition. 

15 The above description of determining a biological condition is exemplified as 

follows: the action of a pharmaceutical or nutraceutical is measured with respect to its anti- 
inflammatory properties. The measurement of the effect may be established using a selected 
panel of constituent gene loci for example, an inflammation selected panel, including, 
Interleukin 1 alpha (DL-loc) or Tumor Necrosis Factor alpha (TNF-oc). The anti- 

20 inflammatory effect may first be established by treating indicator cells or sample cells ex vivo 
with a known inflammation inducers (for example, lipopolysaccharide or other mitogens) 
followed by treatment with the experimental agent or condition expected to affect the 
expression from the appropriate gene loci. Accordingly, a baseline profile data set may be 
established in this case as the gene expression for a particular panel of constituents resulting 

25 in the presence of the inflammation inducer. The addition of a potential anti-inflammatory 
agent results in a change relative to the baseline. This approach is illustrated for example in 
Figure 12. Methylprednisolone has a substantial down regulation effect on IL-2 in blood 
cells stimulated ex vivo with LPS where the baseline data set is LPS stimulated cells. In this 
case the effect is shown as negative. In contrast, as shown in Figure 16b, IL-2 appears to be 

30 upregulated in whole blood not previously exposed to LPS, where the baseline data set is 
unstimulated cells. These results are consistent with the observation that 
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methylprednisolone stimulates IL-2 production. 

The determination of the biological condition of a subject may include measuring and 
storing additional data about the subject. For example, if the subject is a human or 
mammalian patient, additional clinical indicators may be determined from blood chemistry, 
5 urinalysis, X-ray, other chemical assays and physical or sociological findings. 

Figure 7 illustrates how the accumulation of calibrated profile data sets may improve 
the predictive power of the database and thereby increase its value in generating information 
about a biological condition or agent. The figure indicates the use of the database in terms of 
its power, for example, to predict the course of a therapeutic intervention or follow the 

10 course of an individual subject compared to a population. Information from the database may 
be used to predict a likely mechanism of metabolism or molecular mechanism of action, and 
to compare a single profile to a collection of signature, calibrated selected profiles. 

Use of a database in accordance with an embodiment of the present invention is 
illustrated in Figure 8. Figure 8 illustrates a data profile set from the database. Entries for 

15 input include a name, an Experimental Type, and whether the entry is a New Reference; 
Cell/Tissue/Species and whether these are new; Therapeutic agent (compound), Dose, and 
additional parameters and whether the therapeutic agent is new. Observations are recorded 
according to the identity of a Gene (New Gene) and a Protein (New Protein). The Stimulus 
or other Treatment, if any, and the Dose are entered. Gene (and/or Protein) Expression, 

20 Expression Value, Expression Units if appropriate and Expression Time are shown. The 
figure specifically illustrates the range of applicable fields of investigation from complex 
natural products to clinical trails in humans, linkage to traditional forms of measurement and 
evaluation such as literature citations, clinical indicators and traditional pharmacokinetic 
measurements. Expert analysis of the selected profile data contained in the database may 

25 then be used to guide product development and marketing, or used to improve the clinical 
decision making concerning the health of a single individual or population of individuals. 

One form of record may provide information about a subject or agent with respect to 
identity, medical history including traditional pharmaceutical/medical data, clinical 
indications as determined from literature data, reference to additional types of analysis in the 

30 database, etc. 

Figure 9 shows an embodiment of the present invention in which profile data is 
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evaluated using data from a database that is remotely accessed over a network. Using the 
approach of this figure data may be derived at one or more locations (such as location 1 
shown here), compared using information retrieved over communication path 1 109 from a 
central database at location 2, and the result of the comparison may be used to affect, for 
5 example, the course of treatment of an individual or population. The communication path 
1 109 between location 1 and location 2 is two-way, so that information resulting from 
determinations made at location 1 may delivered over the path 1 109 to update the database 

1 108. The consequence is an iterative process whereby the information from database is used 
in a determination that may affect the course of treatment, evaluation, or development, and 

10 the results of the determination become part of the database. In a first location, as in Figure 
5, from a tissue sample procured in process 1 101, there are derived multiple RNA species 
pursuant to process 1 102, and then in process 1 103, profile data are quantified to produce a 
profile data set that is pertinent to the tissue sample obtained in process 1101. In order to 
evaluate the profile data set, in process 1 104 information is retrieved from database 1 108, 

15 which is located in a second location. In fact the database may be in communication with a 
large number of locations, each of which is generating profile data that must be evaluated. 
The retrieval of information from the database is accomplished over a communication path 

1 109, which may include a network such as the Internet,, in a manner known in the art. Once 
information has been obtained from the database 1 108, the information is used in evaluating 

20 the quantified profile data in process 1 105, with the result in process 1 106 that the medical 
condition of the subject may be assessed. In process 1 107, the database 1 108 is updated over 
the communication path 1 109 to reflect the profile data that have been quantified in process 
1 103. In this manner the database 1 108 may be updated to reflect the profile data obtained 
over all locations, and each location has the benefit of the data obtained from all of the 

25 locations. While, for simplicity, all of the processes in Figure 9 are shown as taking place at 
location 1, some or even all of the processes may be implemented elsewhere , for example 
location 2, or in multiple locations. At location 2, associated with the database, for example, 
may be a server that is used for hosting these processes, including evaluation of the 
quantified profile data. 

30 
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Example 1. 

(a) Use of whole blood for ex vivo assessment of a biological condition affected 
by an agent. 

Human blood is obtained by venipuncture and prepared for assay by aliquotting 
5 samples for baseline, no stimulus, and stimulus with sufficient volume for at least three time 
points. Typical stimuli include lipopolysaccharide (LPS), phytohemagglutinin (PHA) and 
heat-killed staphylococci (HKS) or carrageenan and may be used individually (typically) or 
in combination. The aliquots of heparinized, whole blood are mixed without stimulus and 
held at 37°C in an atmosphere of 5% CO2 for 30 minutes. Stimulus is added at varying 

10 concentrations, mixed and held loosely capped at 37°C for 30 min. Additional test 

compounds may be added at this point and held for varying times depending on the expected 
pharmacokinetics of the test compound. At defined times, cells are collected by 
centrifugation, the plasma removed and RNA extracted by various standard means. 

Nucleic acids, RNA and or DNA are purified from cells, tissues or fluids of the test 

15 population or indicator cell lines. RNA is preferentially obtained from the nucleic acid mix 
using a variety of standard procedures (or RNA Isolation Strategies, pp. 55-104, in RNA 
Methodologies, A laboratory guide for isolation and characterization , 2nd edition, 1998, 
Robert E. Farrell, Jr., Ed., Academic Press); in the present use using a filter-based RNA 

TM 

isolation system from Ambion (RNAqueous , Phenol-free Total RNA Isolation Kit, 

20 Catalog #1912, version 9908; Austin, Texas). 

In accordance with one procedure, the whole blood assay for selected profiles 
determination was carried out as follows: Human whole blood was drawn into 10 mL 
Vacutainer tubes with Sodium Heparin. Blood samples were mixed by gently inverting tubes 
4-5 times. The blood was used within 10-15 minutes of draw. In the experiments, blood was 

25 diluted 2-fold, i.e. per sample per time point, 0.6 mL whole blood + 0.6 mL stimulus. The 
assay medium was prepared and the stimulus added as appropriate. 

A quantity (0.6 mL) of whole blood was then added into each 12 x 75 mm 
polypropylene tube. 0.6 mL of 2X LPS (from E.coli serotye 0127:B8 ,Sigma#L3880 or 
serotype 055, Sigma #L4005, lOng/ml, subject to change in different lots) into LPS tubes 

30 was added. Next, 0.6 mL assay medium was added to the "control" tubes with duplicate 

tubes for each condition. The caps were closed tightly. The tubes were inverted 2-3 times to 
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mix samples. Caps were loosened to first stop and the tubes incubated @ 37°C, 5% C02 for 
6 hours. At 6 hours, samples were gently mixed to resuspend blood cells, and 1 mL was 
removed from each tube (using a micropipettor with barrier tip), and transferred to a 2 mL 
"dolphin" microfuge tube (Costar #3213). 
5 The samples were then centrifuged for 5 min at 500 x g, ambient temperature (IEC 

centrifuge or equivalent, in microfuge tube adapters in swinging bucket), 
and as much serum from each tube was removed as possible and discarded. Cell pellets were 
placed on ice; and RNA extracted as soon as possible using an Ambion RNAqueous kit 

10 (b) Amplification strategies. 

Specific RNAs are amplified using message specific primers or random primers. The 
specific primers are synthesized from data obtained from public databases (e.g., Unigene, 
National Center for Biotechnology Information, National Library of Medicine, Bethesda, 
MD), including information from genomic and cDNA libraries obtained from humans and 

15 other animals. Primers are chosen to preferentially amplify from specific RNAs obtained 
from the test or indicator samples, see, for example, RT PCR, Chapter 15 in RNA 
Methodologies, A laboratory guide for isolation and characterization , 2nd edition, 
1998,Robert E. Farrell, Jr., Ed., Academic Press; or Chapter 22 pp.143-151, RNA isolation 
and characterization protocols . Methods in molecular biology, Volume 86, 1998, R. Rapley 

20 and D. L. Manning Eds., Human Press, or 14 in Statistical refinement of primer design 
parameters, Chapter 5, pp.55-72, PCR applications: protocols for functional genomics, 
M.A.Innis, D.H. Gelfand and JJ. Sninsky, Eds., 1999, Academic Press). Amplifications are 
carried out in either isothermic conditions or using a thermal cycler (for example, a ABI 
9600 or 9700 or 7700 obtained from Applied Biosystems, Foster City, CA; see Nucleic acid 

25 detection methods, pp. 1-24, in Molecular methods for virus detection , D.L.Wiedbrauk and 
D.H., Farkas, Eds., 1995, Academic Press). Amplified nucleic acids are detected using 

TM 

fluorescent-tagged detection primers (see, for example, Taqman PCR Reagent Kit, 
Protocol, part number 402823 revision A, 1996, Applied Biosystems, Foster City CA.) that 
are identified and synthesized from publicly known databases as described for the 
30 amplification primers. In the present case, amplified DNA is detected and quantified using 
the ABI Prism 7700 Sequence Detection System obtained from Applied Biosystems (Foster 
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City, CA). Amounts of specific RNAs contained in the test sample or obtained from the 
indicator cell lines can be related to the relative quantity of fluorescence observed (see for 
example, Advances in quantitative PCR technology: 5' nuclease assays, Y.S. Lie and C J. 
Petropolus, Current Opinion in Biotechnology, 1998, 9:43-48, or Rapid thermal cycling and 
5 PCR kinetics, pp. 21 1-229, chapter 14 in PCR applications: protocols for functional 
genomics, M.A.Innis, D.H. Gelfand and J J. Sninsky, Eds., 1999, Academic Press. 

As a particular implementation of the approach described here, we describe in detail a 
procedure for synthesis of first strand cDNA for use in PCR. This procedure can be used for 
both whole blood RNA and RNA extracted from cultured cells (i.e. THP-1 cells). 
10 Materials 

(l)Applied Biosystems TAQMAN Reverse Transcription Reagents Kit (P/N 808- 
0234). Kit Components: 10X TaqMan RT Buffer, 25 mM Magnesium chloride, deoxyNTPs 
mixture, Random Hexamers, RNase Inhibitor, MultiScribe Reverse Transcriptase (50 U/mL) 
(2) RNase / DNase free water (DEPC Treated Water from Ambion (P/N 9915G), or 
15 equivalent) 

Methods 

1 Place RNase Inhibitor and MultiScribe Reverse Transcriptase on ice immediately. 
All other reagents can be thawed at room temperature and then placed on ice. 

2 Remove RNA samples from -80°C freezer and thaw at room temperature and then 
20 place immediately on ice. 

3 Prepare the following cocktail of Reverse Transcriptase Reagents for each 100 joL 
RT reaction (for multiple samples, prepare extra cocktail to allow for pipetting error): 

1 reaction(|aL) 1 IX, e.g. 10 samples(jxL) 





10XRT Buffer 


10.0 


110.0 


25 


25 mM MgC12 


22.0 


242.0 




dNTPs 


20.0 


220.0 




Random Hexamers 


5.0 


55.0 




RNAse Inhibitor 


2.0 


22.0 




Reverse Transcriptase 


2.5 


27.5 


30 


Water 


18.5 


203.5 




Total: 


80.0 


880.0 (80 uL per sample) 
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4 Bring each RNA sample to a total volume of 20 \ih in a 1.5 mL 
microcentrifuge tube (for example, for THP-1 RNA, remove 10 jiL RNA and dilute to 20 jxL 
with RNase / DNase free water. . .for whole blood RNA use 20 total RNA) and add 80 \iL 
RT reaction mix from step 5.2.3. Mix by pipetting up and down. 
5 5 Incubate sample at room temperature for 10 minutes. 

6 Incubate sample at 37°C for 1 hour. 

7 Incubate sample at 90°C for 10 minutes. 

8 Quick spin samples in microcentrifuge. 

9 Place sample on ice if doing PCR immediately, otherwise store sample at - 
10 20°C for future use. 

10 PCR QC should be run on all RT samples using 18S and p-actin (see SOP 
200-020). 

Example 2 . Different inflammatory stimuli give rise to different, baseline profile 

15 data sets so that the calibrated selected profiles for different agents in the same class of anti- 
inflammatory result in different signature profiles. 

Figures 1 1(a) and 1 1(b) show different inflammatory stimuli give rise to different, 
baseline profile data sets that may be used in determining the calibrated selected profile data 
sets for the three anti-inflammatory agents tested, and the resulting different signature 

20 profiles. The different profiles reflect the difference in the molecular targets and 

mechanisms of action of the three agents derived from a single class of therapeutics, anti- 
inflammatory agents. Figure 1 1(a) also illustrate the extraordinary range of detection (y- 
axis) from less than 10 fold difference from the calibrated profile with respect to some 
constituents to a change of 10 (10E13) in gene expression of one constituent (indeed the 

25 change for a constituent in Figure 1 1(b) is 10' 22 ) when compared to the calibrator. 

Comparison to the calibrator results in gene expression profiles that are increased, decreased, 
or without change from the calibrated set. 

Figure 1 1(a) shows relative gene expression (mRNA synthesis) in heat-killed 
staphylococci (HKS)-stimulated cells, and the effect of three different compounds (TPCK, 

30 UT-77, and "Dex", or dexamethasone). Compound TPCK caused a 10-fold decrease in 

relative IFN-y expression, and 100,000-fold decreases in IL-4 and IL-5 expression. Further, 
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compound UT-77 caused even greater magnitude of increases in relative expression of the 
gene encoding IL-5, and more modest increases in IL-1 expression (more than 10-fold) and 
IFN-y. Such effects can be highly significant in disease etiologies and outcomes, and have 
predictive value concerning the usefulness as therapeutic agents of these compounds or 
5 similar chemical entities or chemicals that act similarly. HKS cells may be used as an in 
vitro model of Gram-positive bacterial infection. 

Fig. 11(b) displays analyses of expression of the 12 genes in lipopolysaccharide- 
(LPS)-treated cells, an in vitro model of Gram-negative bacterial infection. These data 
include several striking contrasts to the data in Fig. 1 1(a). Thus treatment with the 

10 therapeutic agent Dex caused a striking decrease in expression of the IL-2 gene in LPS- 

treated cells, and a striking increase in IL-2 expression in HKS-treated cells. Strikingly large 
differences in gene expression in the differently stimulated cells can be seen for the IL-4 and 
the IL-5 genes. Expression of the gene for IFN, in contrast, responded similarly in cells 
treated by either of the stimuli and any of the therapeutic agents. 

15 By these criteria, expression of the genes for IL-2, IL-4 and IL-5 were observed to be 

candidate markers or surrogate markers in cell model systems to distinguish responses of the 
cells to Gram-positive and Gram-negative bacterial infection. 

Example 3. A single therapeutic agent for treating a particular condition can be 
differentiated from a second therapeutic agent that also treats the particular condition by a 

20 signature profile for a given selected panel of gene loci. 

Figure 12 shows a calibrated profile data set for a panel having 8 constituents that are 
indicative of a biological condition that includes inflammation. The profiles are shown for 
three different anti-inflammatory agents-methotrexate, meclofenamate and 
methylprednisolone. The calibrated profile data sets for each agent as shown represents a 

25 signature profile for that agent. This signature profile may serve as a device for establishing 
quality control for a batch of the agent. Indeed, it is envisaged that compounds or classes of 
compounds on the market or in development may be characterized by a signature profile. 
The signature profile may be represented in a graphical format, more particularly as a bar 
graph as provided in Figure 12. For Figure 12, an ex vivo sample was tested. A sample of 

30 blood was taken from the subject. Aliquots of the sample were subjected to 

lipopolysaccharide (LPS) ex vivo. After 30 minutes, the anti-inflammatory agent as indicated 
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was added to an aliquot of the sample of blood and after about another 4 hours, the 
expression of the panel of genes (D-la, 11-2, 11-8, 11-10, Il-12p35, H-12p40, IL-15, IFN- 
Gamma and TNF-a) was determined. Although the calibrated profile of methotrexate and 
meclofenamate were similar, the calibrated profile of methylprednisolone was substantially 
5 different. Differences may be reflective of the differences of the mechanisms or target(s) of 
action of this agent within the general class of anti-inflammatory compounds. The baseline 
is the profile data set for lipopolysaccharide absent any additional agents. 

Example 4 . There is relatively low variability with respect to the profile within a 
single individual over time when the calibrated selected profile is determined from the 
10 measurement of gene expression across many gene loci that have been appropriately 
induced. 

Figures 13(a), 13(b), and 13(c) show graphical representations of calibrated selected 
profile data sets for two different samples of whole blood. Heparinized whole blood from a 
single normal healthy volunteer was collected on two separate occasions of more than 2 

15 weeks apart. Figure 13a, for sample 991 1 16, and Figure 13b, for sample 991028, reflect the 
biological condition of the tested cells from the single donor using a selected panel (i.e., the 
inflammation selected panel) of 24 members, in response to stimulation with one of three 
different agents. The baseline in this example is derived from untreated cells obtained from 
the same individual. The calibrated profiles are shown for cells exposed for 4 to 6 hours to 

20 lipopolysaccharide (LPS), heat-killed Stapylococci (HKS), and phytohemagglutinin (PHA). 
Figure 13c shows a direct comparison of LPS-stimulated blood sample 991 16 with respect to 
blood sample 991028, i.e., 991028 is used as the calibrator or baseline profile data set. The 
messenger RNA levels measured on 10/28/99 were used to compare the levels of messenger 
RNA measured on 1 1/16/99. A perfect identity of RNA levels would be represented by a flat 

25 line at unity. These data show that for baseline gene expression, there can be as much as an 
8 fold difference (c-jun) in messenger RNA levels. However, for most of the genes 
measured, when there is no known substantial physiological change in the subject, the levels 
of messenger RNA measured on one day are similar to those measured on a different day. 
Changes in gene expression, whether mRNA or protein, in excess of 10-20% may be 

30 reflective of biological changes in the subject even though traditional clinical measurements 
may not identify such changes. Figure 13(d) is similar to Figure 13(c) except that the cells 
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were not stimulated with LPS. 

Figures 13(a) through 13(d) document the relatively low variability with respect to 
the profile within a single individual over time in similar physiological conditions when the 
calibrated selected profile is determined from the measurement of gene expression across 
5 many gene loci that have been appropriately induced. The figures illustrate (1) the class- 
specific effects (generally inflammatory as determined by the effect on pro-inflammatory 
gene loci, e.g. TNF-alpha, IL-1 alpha and IL-1 beta ), (2) the agent-specific effects 
quantitative differences between each of the agents at the same gene loci (e.g., IL-2) and (3) 
reproducible and therefore predictable effects on the subject population, TK (Figure 13c). 

10 Example 5 . Similarities and differences in the effect of a single agent on cell 

populations differing in their biological condition. 

Ex-vivo gene expression analysis can be performed by obtaining the blood of a 
subject for example by drawing the blood into a vacutainer tube with sodium heparin as an 
anticoagulant. An anti-inflammatory such as 3-methyl-prednisolone at a final concentration 

15 of 10 micromolar was added to blood in a polypropylene tube, incubated for 30 minutes at 
37C in 5% CO2 . After 30 minutes a stimuli such as LPS at 10 ng/mL or heat killed 

staphlococcus (HKS) at 1:100 dilution was added to the drug treated whole blood. 
Incubation continued at 37°C in 5% CO2 for 6 hours unless otherwise indicated. 

Erythrocytes were lysed in RBC lysis solution (Ambion) and remaining cells were lysed 
20 according to the Ambion RN Aqueous-Blood module (catalog #1913). RNA was eluted in 
. Ambion elution solution. RNA was DNAsed treated with 1 unit of DNAse I (Ambion 
#2222) in IX DNAse buffer at 37°C for 30 minutes. In this example, first strand synthesis 
was performed using the Applied Biosystems TaqMan Reverse Transcriptase kit with 
MultiScribe reverse transcriptase (catalog # N808-0234). Quality check of RT reactions 
25 were performed with Taqman PCR chemistry using the 18S rRNA pre-developed assay 
reagents (PDAR) from Applied Biosystems (part #4310893E). PCR assay of Source 
Selected Profiles were performed on 6 to 24 genes in four replicates on the Applied 
Biosystems 7700. PCR assays were performed according to specifications outlined with the 
PDAR product. Relative quantitation of the gene of interest was calibrated against 18S 
30 rRNA expression as described in Applied Biosystems product User Bulletin 2 (1997) and 
elaborated in Hirayama, et al (Blood 92, 1998:46-52) using 18S instead of GAPDH. 
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Relative quantitation of the mRNA was measured by the difference in threshold 
cycles between 18S and the gene of interest. This delta C T was then compared to the 

normalizing condition, either subject before treatment, or stimuli without drug in an ex-vivo 
assay to measure "fold induction" represented in the bar graphs (Figure 14). For example, in 
5 the above graph, IFN- levels are 1/50 less on day 3 than before treatment. 

Example 6 . In Vivo and Ex vivo samples provide comparable signature profiles. 
Figure 15 shows the calibrated profile data set for two subjects (Subject 1 and Subject 
2) who have been treated over a three day period with a standard dose of the corticosteroids, 
dexamethasone. Blood from each subjects was obtained 72 hours later and a quantitative 
10 measure of the amount of RNA corresponding to the panel constituents was determined. 

Although, the calibrated profile data set for each subject was similar for most gene loci, some 
notable differences were also detected, for example for 11-2, 11-10, H-6 and GM-CSF. A 
calibrated profile data set is also shown for comparison for an ex vivo sample of blood from 
sample 1 prior to treatment with corticosteroid where the ex vivo sample is subjected to an 
15 equivalent amount of corticosteroid in vitro as calculated to be the plasma level in the 

subject. The similarity in the calibrated profile data set for ex vivo samples when compared 
to in vivo samples provides support for an in vitro assay that will predict the in vivo action of 
the compound. We have observed a similar comparable effect between in vivo and ex vivo 
samples infected with an infectious agent, more particularly bacterial or viral agents. We 
20 have concluded therefore that the ex vivo samples provide an effective method of 

determining the effect of a single compound or multiple compounds on a patient, where the 
multiple compounds may be either used in combination, in parallel or sequentially to 
optimize the selection of an agent for a biological condition for the subject. 

Example 7 . Demonstration of reproducibility of an in vitro response with an 
25 approved anti-inflammatory on 5 different donor subjects. 

Comparison and analysis of the Figures 18a through 18e demonstrate the consistency 
of effect of the stimulus and in vitro treatment with an approved anti-inflammatory on 5 
different donors (each figure representing a unique donor). The use of a known and tested 
stimulus results in a highly reproducible gene response in vitro that may be correlated with a 
30 predictable in vivo response. 

Figures 18a-18e provide the results of analysis of 5 donors from which a blood 
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sample has been taken. The blood samples were exposed to a therapeutic agent at various 
concentrations ranging from 0.1|iM to 5|lM, more particularly O.IjliM, 03\iM, lpM, 3|nM 
and 5|iM, for a 4 hour period. Different concentrations of the drug resulted in a calibrated 
profile data set for an inflammation panel at each concentration that was qualitatively 
5 different from the next. Figure 18a corresponds to donor 1, Figure 18b corresponds to donor 
2, Figure 18c corresponds to donor 3, Figure 18d corresponds to donor 4, and Figure 18e 
corresponds to donor 5. Each individual varied from the other and also provided a variable 
profile for a different concentration. This set of figures illustrates the high level of 
information obtainable by calibrated profile data sets. 

10 Example 8 . A calibrated profile data set may provide a signature profile for a 

complex mixture of compounds. 

Figure 21 illustrates the effect of three different anti-inflammatory herbs on a selected 
panel of constituents including constituents of an Inflammation Selected Panel (TNF-a, II- lb, 
ICAM, 11-8, H-10, Il-12p40, ICE, cox-2, cox-1 and mmp-3) a cell growth and differentiation 

15 selected panel (c-fos, c-jun and STAT3) , a toxicity selected panel (SOD-1, TACE, GR, 

HSP70, GST, c-fos, c-jun, INOS) and a liver metabolism selected panel (INOS, cyp-a and u- 
pa). The cells assayed in Figure 21 are aliquots of blood from a subject that are exposed ex 
vivo to lipopolysaccharide and to Echinacea (SPM9910214) Arnica (SPM99 10076) and 
Siberian Ginseng (SPM99 10074), each of the nutraceuticals being applied to the blood 

20 sample at the same concentration of 200ug/mL. The baseline is cell sample with 

lipopolysaccharide in the absence of a nutraceutical. Each nutraceutical (formed from a 
complex mixture) has a characteristic signature profile just as did the single compound 
pharmaceutical anti-inflammatory agents. The signature profile may be provided in a 
graphic form that can be use to identify a herbal while providing information concerning its 

25 properties and its efficacy for a single subject or for an average population of subjects. 

Example 9 . A quality control assay for Echinacea brands using calibrated profile 
data sets. 

Figure 24 shows a graphic representation of the calibrated profile data sets for four 
different commercial brands of Echinacea. Brands using an Inflammation Selected Panel. 
30 As expected, SPM007 and SPM003 gave the signature, calibrated profiles similar to 
authentic Echinacea. Samples SPM010 and SPM 016, although labeled and sold as 
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Echinacea when tested using the system described in Figure 14, resulted in signature 
calibrated profiles that were substantially similar to the profile obtained with 
lipopolysaccharide alone. Echinacea samples SPM010 and SPM016 were found to have 
elevated, highly biologically active levels of endotoxin while the LPS levels in SP007 and 
5 SP003 were undetectable. A stored signature profile for active Echinacea obtained from a 
selected panel designed to test efficacy and mode of action, e.g., the inflammation panel, 
permits evaluation of new batches of Echinacea, differentiation of existing or new brands of 
Echinacea, guide the isolation and development of new compounds with different or similar 
activities from a complex compound like Echinacea or may be used in the development of 

10 quality assurance in the production, analysis and sale of new or previously marketed 

compounds. In the example cited, two of the brands of Echinacea SP010 and SP016.result in 
calibrated profiles that are characteristic of authentic Echinacea. 

Example 10 . Comparison of three herbal preparations using an indicator cell line. 
Figures 25 (a) through 25(c) provide calibrated profile data sets for three herbal 

15 preparations with respect to an indicator cell line (THP-1) rather than a blood sample from a 
subject. In Figure 25(a), the baseline is the profile data set for THP-1 cells absent the herbal 
while the histograms represent the calibrated profile data sets for the same herbal from three 
different manufacturing sources of the same herb at 250ug/ml. Gene expression results are 
shown on a log scale. Similar to the observation in Figure 14, these demonstrate that 

20 similarly labeled compounds obtained from different sources have demonstrable and 

quantifiable differences in calibrated profiles using a specific panel, e.g., the inflammation 
selected panel designed to obtain information about the expression of gene products related 
to inflammation and infection. This suggests that the compounds likely have different 
efficacies when used for specific purposes. 

25 Figure 25(b) provides a comparison of the calibrated profile of a single herb at three 

concentrations using the indicator cell line of THP-1. The baseline profile data set is 
untreated THP-1 cells. Analysis of the data suggests a concentration-dependent response in 
the indicator cell lines which, although demonstrated here, may be indicative of a similar 
response in subjects. 

30 Figure 25(c) provides a comparison of four commercial Echinacea brands used at the 

same concentration and tested against a panel of constituents using a THP-1 cell line as an 



2331/130 



indicator cell population. Differential expression, as revealed, for example, by inspection or 
calculation of differences in the calibrated profiles, allows direct comparisons of complex 
compounds to be made. For example, analysis of the differences in the calibrated profiles 
may be used to guide compound isolation and development, product differentiation in the 
5 marketplace, or used by the consumer or health professional to guide the individualized 
choice of a single compound from a class of similar compounds that may be suited for a 
particular biological condition. 

Example 11 . Set up of a 24-gene Human Selected Panel for Inflammation. 
Materials 

10 1 20X Primer/Probe Mix for each gene of interest. 

2 20X Primer/Probe Mix for 18S endogenous control. 

3 2X Taqman Universal PCR Master Mix 

4 cDNA transcribed from RNA extracted from cells 

5 Applied Biosystems 96-Well Optical Reaction Plates 
15 6 Applied Biosystems Optical Caps, or optical-clear film 

7 AB Prisma 7700 Sequence Detector 
Methods 

1 Make stocks of each Primer/Probe mix containing the Primer/Probe for the 
gene of interest, Primer/Probe for 18S endogenous control, and 2X PCR Master Mix as 

20 follows. Make sufficient excess to allow for pipetting error e.g. approximately 10% excess. 
The following example illustrates a typical set up for one gene with quadruplicate samples 
testing two conditions (2 plates). 

1X( 1 well) 9X (2 plates worth) 
2X Master Mix 12.50 112.50 

25 20X 1 8S Primer/Probe Mix 1 .25 11 .25 

20X Gene of interest Primer/Probe Mix 1 .25 1 1 .25 

Total 15.00 135.00 

2 Make stocks of cDNA targets by diluting 95^1 of cDNA into 2000|il 
30 of water. The amount of cDNA to be adjusted to give Ct values 

between 10 and 18. 
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3 Pipette 15|jl of Primer/Probe mix into the appropriate wells of a Applied 
Biosystems 96-Well Optical Reaction Plate. 

4 Pipette 10|il of cDNA stock solution into each well of the Applied Biosystems 
96-Well Optical Reaction Plate. 

5 5 Seal the plate with Applied Biosystems Optical Caps, or optical-clear film. 

6 Analyze the plate on the AB Prism 7700 Sequence Detector. 
In Examples 12 through 19 below, procedures analogous to those of Examples 1 and 
1 lwere followed to determine relative mRNA expression. 

Example 12 . Calibrated profile data sets, using a subset of the Inflammation Selected 
10 Panel, show the effect of administration of a steroid. 

In Figures 26(a) and 26(c), subjects 1 and 2 respectively have been subjected to a 
course of administration of methylprednisone twice a day for three consecutive days. In 
Figures 26(b) and 26(d), two other individuals, identified as control 1 and control 2 
respectively, were administered a placebo over a corresponding period. In each case, the 
15 blood samples of the individuals were taken prior to the course of administration and 

immediately following the course of administration. The graphs of these each of these figures 
show the relative concentration of each constituent of a subset of the Inflammation Selected 
Panel, the subset being chosen for its ability to discriminate as to the effect of anti- 
inflammatory agents. For each constituent, the post-administration concentration is shown as 
20 a ratio in relation to its pre-administration concentration; hence the baseline of 1 is indicative 
of the same concentration of the constituent before and after administration, and the presence 
of a bar below the baseline indicates a post-administration drop in concentration of the 
constituent. Figures 26(a) and 26(c) show that the response of subjects 1 and 2 to the 
administration of the steroid are qualitatively and quantitatively similar — indeed, strikingly 
25 similar. Moreover, and in contrast, the responses of controls 1 and 2 to the placebo are 
markedly distinct from the responses of subjects 1 and 2. 

Example 13 . Calibrated profile data sets, using a subset of the Inflammation Selected 
Panel, provide a comparison of the effects of administration of methylprednisone and 
Ibuprofen. In this example, Figures 27(a) and 27(c) are identical to Figures 26(a) and 26(c), 
30 and show the responses of subjects 1 and 2 respectively to the administration of 

methylprednisone. Figures 27(b) and 27(d) show the responses of the same subjects, namely 
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subjects 1 and 2 respectively, to the administration of high-dose Ibuprofen (800 mg 
administered three times per day over a three-day period). (Figures 27(b) and 27(d) use the 
same conventions as Figures 27(a) and 27(c) in showing post-administration concentration of 
constituents relative to pre-administration concentration.) It can be seen from these figures 
5 that the responses of the subjects to Ibuprofen are qualitatively and quantitatively similar — 
again, strikingly similar. Moreover, the responses of the subjects to Ibuprofen are distinct 
from the responses of the subjects to methylprednisone. In fact, the distinct pattern of 
response to Ibuprofen correlates with other information known about Ibuprofen. For 
example, IL-l-p is known to be of importance in responding to joint destruction, and 

10 Ibuprofen here is shown as not raising the level (and in the case of subject 2, lowering the 
level) of IL-l-P expression. And Ibuprofen is known to be not very effective in treating joint 
destruction. Similarly, IL-10 activity is associated with anti -inflammatory activity and is 
useful in addressing bowel inflammatory disease; Ibuprofen here is shown as in fact 
depressing the level of IL-10 expression. These phenomena are consistent with the fact that 

15 Ibuprofen is known to be ineffective in addressing bowel inflammatory disease. 

Example 14 . Calibrated profile data sets, using a subset of the Inflammation Selected 
Panel, identify chronic obstructive pulmonary disease (COPD) patients. 

The graphs of these each of Figures 28(a) through 28(d) show the relative 
concentration of each constituent of a subset of the Inflammation Selected Panel, the subset 

20 being chosen for its ability to discriminate as to the presence of COPD. For each constituent, 
the concentration is shown as a ratio in relation to a concentration that is normative of the 
concentrations of the constituent in a population of healthy subjects. Thus for any 
constituent, a baseline level of 1 corresponds to a normal concentration. Figures 28(a) and 
28(c) show the relative concentrations of constituents in COPD patients 1 and 2 respectively, 

25 while Figures 28(b) and 28(d) show the relative concentrations of constituents in two healthy 
individuals identified as control 1 and control 2 respectively. Indeed, Figures 28(b) and 
28(d) show levels of constituents in controls 1 and 2 as being close to population normals, 
whereas Figures 28(a) and 28(c) show levels of constituents in COPD patients 1 and 2 as 
being dramatically different from normal levels. 

30 Example 15 . Evaluations, of the effects of drug exposure performed in vitro 

correspond closely with evaluations performed in vivo, employing in each case calibrated 
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profile data sets, using a subset of the Inflammation Selected Panel. 

Figures 29(a) and 29(b) present graphs showing response to the administration of 
methylprednisone. The response shows the relative concentration of each constituent of a 
subset of the Inflammation Selected Panel, the subset being chosen for its ability to 
5 discriminate as to the effect of anti-inflammatory agents. For each constituent illustrated, the 
darker bar (on the right) shows the in vivo response of a subject to a course of administration 
of methylprednisone twice a day for three consecutive days. In the in vivo cases, a blood 
sample of the subject was taken prior to the course of administration and immediately 
following the course of administration, the constituents were measured, and the responses are 

10 shown in Figures 29(a) and 29(b). The post-administration concentration of each constituent 
is shown as a ratio in relation to its pre-administration concentration; hence the baseline of 1 
is indicative of the same concentration post-administration as pre-administration. The 
procedures of Figures 29(a) and 29(b) were conducted on two different occasions a year 
apart. At the same time that each in vivo procedure was conducted, methylprednisone was 

15 also administered in vitro to a sample of the blood of the same subject. For each constituent 
illustrated in each of Figures 29(a) and 29(b), the lighter bar (on the left) shows the in vitro 
response of the sample to the administration of the drug. Again, the post-administration 
concentration of each constituent is shown as a ratio in relation to its pre-administration 
concentration. 

20 What is remarkable about the results shown in Figures 29(a) and 29(b) is that in each 

procedure, the in vitro response is strikingly similar to the in vivo response, in most cases 
even where the results in vivo differed over time. This result shows the value of in vitro 
modeling for the evaluation of the effect of the administration of agents using Selected 
Panels. 

25 Example 16 . The effect of different agents is evaluated using a subset of the Selected 

Prostate Panel. 

In Figure 30 is shown the response of five different cell lines to the administration of 
various agents, using a subset of the Selected Prostate Panel (listed in Table 5). This figure 
also shows broad functions of constituents of the panel. 
30 Example 17 . The use of a rat liver metabolism selected panel to measure the effect of 

a pharmaceutical agent, clofibrate, Male rats were treated with 400mg/kg/day of clofibrate 
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administered by mouth and the levels of gene expression were measured in liver tissue. 
Clofibrate is used here because its metabolism in the rat and human liver is well described. 
As expected, clofibrate induces gene expression at the cyp 1 Al locus, but the agent also 
induces expression at a number of other metabolic loci in the selected panel as measured in 
5 this cohort of in-bred Spraque-Dawley rats. The ratio of the concentration of each constituent 
for the clofibrate treated rats is measured with reference to a control (baseline) which is a set 
of rats treated only with the carrier compound. The resultant selected profile is provided in 
Figure 31. 

Example 18 . The ability of the rat metabolism selected panel to differentiate drug 
10 responses 

Male rats were treated with 400mg/kg/day of clofibrate or benzo[a]pyrene 
administered by mouth and the levels of gene expression were measured in liver tissue. The 
response to clofibrate and benzo[a]pyrene was determined in Spraque-Dawley rats using a 
rat metabolism panel. The results are shown in Figure 32. Each drug gives a characteristic 

15 and distinct pattern of gene induction across the selected panel. As expected from the prior 
art, benzo[a]pyrene specifically induces gene expression at the loci for cyp 4A1 and HD. The 
control (baseline) is a set of rats treated only with the carrier control.. 

Example 19 . The effect of administration of a stimulant is measured by a skin- 
epithelial/vascular/inflammation selected panel. 

20 Figure 33 illustrates the response of a subject to the administration of a stimulant 

(TNF-alpha, 10 ng/ml), as measured by a skin-epithelial, vascular/inflammation selected 
panel. In this example, the selected panel is created from constituents that are also found in 
other panels which have been here selected for purposes of best establishing an effect 
resulting from the stimulant. 

25 Example 20. Use of a human liver selected panel for determining the metabolic 

properties of cryopreserved human hepatocytes . 

Time/dose response experiments utilizing compounds with well-described toxicities 
and mechanisms of action are an early step in the biological validation of the selected panel. 
Figure 34 shows the gene expression profile resulting from a timed study at constant dose of 

30 benzo(a)pyrene when administered to cryopreserved human hepatocytes. 

Example 21. Response of human umbilical vein endothelial cells to TNF-a. 
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Figure 35 illustrates how endothelial cells respond to the inflammatory TNFa by the 
induction of expression of a number of gene loci, notably the adhesion molecules ELAM, 
ICAM, and VCAM. The cells were exposed for 24 hours. Both 5 ng/ml and 10 ng/ml are 
high doses of the immunostimulant TNFa and , as expected, no clear concentration response 
5 is observed at this dose level. 

Example 22. This example is one illustration of the wide embodiments of the selected panels. 
In this example the effect of a compound in solution (NAC) is compared directly to the effect 
of an environmental stimulus (UVB) and the combined effect is read out as differential gene 
expression. Figure 19 illustrates an similar effect of NAC only in blood obtained smokers 
10 and non-smokers. Dose and time experiments were conducted prior to this illustrated 
experiment. 

Example 23. Gene expression profiles provide information on the effect of an 
environmental stimulus on cells. 

This example is one illustration of the wide embodiments of the selected panels. In. 

15 this example (see Figure 36) the effect of a compound in solution (N-acetylcysteine) is 

compared directly to the effect of an environmental stimulus (UVB) and the combined effect 
is read out as differential gene expression. Figure 19 illustrates an similar effect of NAC only 
in blood obtained smokers and non-smokers. Dose and time experiments were conducted 
prior to this illustrated experiment. 

20 Example 24. Demonstration of reproducibility of an in vitro response with an approved 
anti-inflammatory on different donor subjects. 

The consistency of effect of the stimulus and in vitro treatment with an approved anti- 
inflammatory on different donors is performed by taking blood samples, as described above 
in Example 1, and preparing aliquots for baseline, non-stimulus, and stimulus with sufficient 

25 volume for at least three time points. Typical stimuli include ibuprofen, acetaminophen, 

aspirin, prednilisone, prednisone or dexamethasone and may be used individually (typically) 
or in combination. The blood samples are exposed to an anti-inflammatory therapeutic agent 
at various concentrations ranging from O.ljiM to 5(jM, more particularly 0.1|iM, 0.3|jM, 
l(iM, 3(iM and 5|jM, for a 4 hour period. Different concentrations of the drug will provide a 

30 different profile data set for an inflammation panel, comprised of two or more genes from 
Tables 1-8, at each concentration that is qualitatively different from the next. At defined 
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times, cells are collected by centrifugation, the plasma removed and RNA extracted by 
various standard means, amplified and quantitated, as described in Example 1 above. From 
the quantitative measurements of amplified RNA, profile data sets for the selected panel are 
derived for each aliquot. 

5 Each resulting profile data set is descriptive of the biological condition of the sample 

at the time and circumstances at which the sample is taken, and therefore descriptive of the 
biological condition of the donor from which the sample is taken. 

Each sample from the individual donors varies from the other and also provides a 
variable profile for a different concentration. Such a technique allows a high level of 
10 information obtainable by profile data sets. 

Example 25. Different inflammatory stimuli give rise to different profile data sets so that 
the selected profiles for different agents in the same class of anti-inflammatory result in 
different signature profiles. 

15 Different inflammatory stimuli give rise to different profile data sets that may be used 

in determining the selected profile data sets for whichever anti-inflammatory agents are 
tested, and the resulting different signature profiles. The different profiles reflect the 
difference in the molecular targets and mechanisms of action of the different agents derived 
from a single class of therapeutics, i.e., anti-inflammatory agents. An extraordinary range of 

20 detection, from less than 10-fold difference from the profile with respect to some constituents 
to a change of 10 13 (10E13) in gene expression of one constituent is not unprecedented, 
when compared to the baseline profile data set (no stimulus). Comparison to the baseline 
results in gene expression profiles that are increased, decreased, or without change from the 
baseline set. 

25 Such large and small effects can be highly significant in disease etiologies and 

outcomes, and have predictive value concerning the usefulness as therapeutic agents of 
specific compounds, or those with similar chemical entities, or those chemicals that act 
similarly. 

Example 26.Gene expression profiles provide information on the effect of an environmental 
30 stimulus on cells. 

As an illustration of the wide embodiments of using a selected panels, the effect of a 
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compound in solution is compared directly to the effect of an environmental stimulus (UVB) 
and the combined effect is read out as differential gene expression. A similar effect of a 
compound may be observed in blood obtained from smokers and non-smokers. Dose and 
time experiments are conducted prior to this type of experiment, to optimize concentrations 

5 of compounds to be investigated, or to define useful exposure conditions for the 

environmental stimulus. Such a method allows profile data sets to be obtained from samples 
from individuals, before exposure and/or treatment, after exposure and/or treatment, and 
provides a description of the biological condition of the sample at the time and circumstance 
it is obtained, which in turn provides a description of the biological condition of the 

10 individual donor, both before and after treatment and/or exposure. 
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Table 1. Inflammation Selected Panel 



IL-la 


Interleukin-1 alpha 


IL-lp 


Interleukin-lBeta 


IL-2 


Interleukin-2 


IL-4 


Interleukin-4 


IL-6 


Interleukin-6 


IL-7 


Interleukin-7 


EL-8 


Interleukin-8 


IL-10 


Interleukin-10 


IL-12p40 


Interleukin-1 2p40 


IL-15 


Interleukin-1 5 


IL-18 


Interleukin-1 8 


GM-CSF 


Granulocyte colony stimulating factor 


IFN-y 


Interferon gamma 


TGF-a 


Tumor growth factor alpha 


TNF-a 


Tumor necrosis factor alpha 


TNF-p 


Tumor necrosis factor beta 


Cox 2 


Cyclooxygenase / prostaglandin-endoperoxide 
synthase 2 


ICE [ 


Interleukin-1 converting enzyme 


c-jun 


MKK7, MAP2K7 


mmp9 


Matrix metalloproteinase 


UPA 


Urokinase plasminogen activator 


HSP70 


Heat Shock Protein 70 kDa 


CRE 


cAMP Response Element 


ICAM 


Intercellular Adhesion Molecule 
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Table 2. Cell Growth and Differentiation Selected Panel 



BIKC5 (burvivm) 


Apoptosis Inhibitor 


JNrlsjDl 


NF-kappaB 


Pni/MO A /D 1 £\ 

CUlsJNzA (rlu) 


Cell cycle inhibitor 


lr33 (F53) 


Tumor suppressor 


TNr-a 


Tumor Necrosis Factor alpha 




Telomerase Catalytic Subunit 


BCL2 


Represses Apoptosis , 


O A V 

BAX 


Promotes Apoptosis , 


CAbPl (lCb) 


Interleukin Converting Enzyme 


/-» a r\T\ A C A 

CjADD45A 


Growth arrest protein 


IJNrRorllA (RANK) 


Receptor activator of NFkB 


PDCD8 (A1F) 


Apoptosis Inducing Factor 


A ~ J? 1 

Apaf-1 


Apoptotic protease activating factor 1 


DFFB (DFF40) 


Caspase activated DNAse 


T"> A T A T»0 /T A "T> 1 "P> TT> O \ 

BAIAP3 (IAP1, BIRC3) 


Inhibitor of apoptosis protein 1 (B Al-associated protein 3) 


BIRC2 (IAP2) ~ 


Inhibitor of apoptosis protein 2 


T> * 1 

BlK 


BCL2 interacting killer 


BCL2L1 (BCL-X) 


BCL2-Like 1 


DADl 


Defender against cell death 1 


x at a r\r\ 

MADD 


MAP Kinase activating death domain 


MAP3K14 


Mitogen-activated protein kinase 3, 14 


TV IM 'XT 

PTEN 


Protein tyrosine phosphatase 


k-alpha-1 


Alpha tubulin (housekeeping, high abundance) 


TACA 


Anti-fas induced apoptosis 


CCIKZ 


Cyclin dependent kinase 2 


cdk4 


Cyclin dependent kinase 4 


CASP 3 


Apoptosis-related cysteine protease, 3 


CASP 9 


Apoptosis-related cysteine protease, 9 


RAD52 


DNA ds break repair 


XRCC5 (Ku80) 


X-ray repair complementing defective repair in Chinese 




hamster cells 5 


PiNKr 


Polynucleotide kinase phosphatase 


Mlvbl 1A 


Meiotic recombination 1 1 homolog A 


CCJNL)! (cyclm Dl) 


PRAD1: parathyroid adenomatosis 1 


CCIND3 (cyclin U3) 


Cyclin D3 


CLiNiil (cyclin 


Cyclin h 


CAJNAZ (cyclin A) 


Cyclin A 


cciNr>i (cyciinuj 


cyciin d 


CDKN2B (pl5) 


Cyclin-dependent kinase inhibitor 2B (pi 5, inhibits CDK4) 


CDKNlA(p21) 


Cyclin-dependent kinase inhibitor 1 A (p21, Cipl) 


RBI 


Retinoblastoma 


BID 


BH3 interacting domain death agonist 


BAK1 


BCL2-antagonist/killer 1 
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BAD 


BCL2-antagonist of cell death 


SMAC [ 


Second mitochondria-derived activator of caspase 


VDAC1 


Voltage-dependent anion channel 1 


CHEK1 


Checkpoint, S.pombe, homolog of, 1 


ABL1 


11 a 1 1 • 1 1 *'1 L 1 i 

v-abl Abelson murine leukemia viral oncogene homolog 1 


TNFRSF12 


Tumor necrosis factor receptor superfamily, member 12 


TRADD 


TNFRSF1 A-associated via death domain 


FADD 


Fas (TNFRSF6) - associated via death domain 


TRAF1 


TNFR associated factor 1 


TRAF2 


TNFR associated factor 2 
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Table 3. Human Liver Metabolism and Toxicity Selected Panel (See Figure 34) 



CYP1A2 


Polycyclic aromatic hydrocarbon (PAH) 
metabolism; induced by smoking 
Catalyzes formation of toxic APAP 
metabolite 


CYP2A6 


Catalyzes oxidation in some 

pharmaceuticals, procarcinogens, and 

smoke constituents; upregulated in vitro 

after exposure to barbiturates or dex. ' 


CYP2E1 


Converts many small organic 
Compounds (i.e. EtOH, APAP, CC14) 

■ a * " 1 * * T 1 1 • "111* 1" 

into reactive intermediates. Induced in alcoholics and in 
fatty liver phenotype. 


CYP2D6 


Broad catalytic activities for over 30 
therapeutic drugs. 


CYP3A4 


Metabolism for a wide variety of drug types 


UGT2B7 


UDP glycosyltransferase 2B7 


UGT2B15 


UDP glycosyltransferase 2B15 


EPHX1 


Microsomal epoxide hydrolase 
Multiple tissue-specific splicing variants 


GSTA1 


Glutathione S-transferase alpha 1 


GSTA2 


Glutathione S-transferase alpha 2 


UCP-2 


Mitochondrial uncoupling protein 2 


TNF-a 


Local inflammation 

Endothelial activation 

Released by PBMCs, Kupffer cells and 
activated tissue macrophages in the liver 


TGF-p 


Transforming growth factor beta 1 


iNOS 


Inducible nitric oxide synthase; 


SCF 


Stem cell factor, released by activated 
hepatic stellate cells (HSC*); recruits 
mast cells (MC) to the liver 


IFN-y 


Activation of macrophages 


Galectin-3 


p-galactoside-binding lectin associated 
with cell growth, tumor transformation, 
and metastasis. 


FAP 


Fibroblast activation protein; membrane 
protease expressed at sites of tissue 
remodeling. 
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Procollagen C-proteinase 
(akaBMPl) 


Extracellular matrix protein 
(bone morphogenetic protein 1) 
required for cartilage formation 


Collagen I 


Extracellular matrix (ECM) component 


Collagen m 


ECM component 


Collagen IV 


ECM component 


Laminin 


ECM component 


Fibronectin 


ECM component 
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Table 4. Skin Response Selected Panel 



fD APP9 


i^ciiuicu icmiuic ddu-uiiiuing pruicin z> 


JSJxl l*f 


JvcraLlIl 1*t 


JVTv 1 J 


rvciallll J 


KRT16 


T^praHn 1 6 

XVl/1 dLlll J. \J 


FGF7 

J. V-Jl / 


Fihrnhl^Qt crrnwtTi faptor fTCGF 1 TCpraHnnpvtp orowtfi f<\c\csv 

jl iuiuuiuji gi u w ui lclk/xxji \ jlvv_j i y i\vi citJ.iiwv/ y iv»/ u w hi i civ^ iui 


FN1 


FiHrnnpptin 1 


IVL 


Involucrin 


PfiT 7 A 1 


iype vii collagen, dipnd i 


CTGF 


GrvnnppHvp tiQ^np ornwth fpptor 


TT -If/ 


TntprlpiilHri- 1 r/ 


TT -8 


111 IC11CU.KJ.I1 O 


VJI\.v_/l 


ivicidiiuiiid giuwiii oiiiiiuicxLUi y dcuviiy ^ivivjo/a.^ 


PTG<s9 


r lUaldgldllUlll-CllUOpClUAlUC 5>yilllldk>C ^V_V_/y\.Z. ^ 
v^yCIULJAjrgClldoC Z, 




Tiitnnr nprrnQic fiiPtAr JilnriJi 
1 UIHUI iiccxuaJo laciui dipiid 


TV*" 1 !? R 1 


lransiornung growin iacior ucia i 




Proteinase inhibitor 3 


rJJ 


(SKALP) Skin-derived antileukoproteinase 


Den 


rsasignin ^JDmivirKiiN ) xixtraceiiuiar iviiYLr lnuucer 


MMri 


Matrix metalloproteinase 1 (interstitial collagenase) 


MiVlrZ 


Matrix metalloproteinase 2 (72kD gelatinase) 


MMrj 


Matrix Metalloproteinase 3 (Stromelysin 1) 


TTA/tI> 1 
1 IMrl 


Tissue inhibitor of matrix metalloproteinase 


tnV/TfiY 1 


rieme oxygenase i 


CI a T^TVI ^ A 
LrAUJJ4DA 


Growth arrest and DNA- damage-inducible alpha 


PPM A 


r roiiieraung ceil nuciear antigen 


TYTTQP1 
JJUor 1 


uuai specmciiy pnospnatase^Liuuj 


MAPK8 


Mitogen activated protein kinase 8 




Tumor protein p53 (p53) 


rsciz 


r>-ceii ^.LL/iympnoma z 


Bax 


Bcl2-associated X protein 


TT TM 


c-jun 


FOS 


c-fos 


NR1I2 


Nuclear receptor subfamily 1, group I, member 2 (PAR2) 
Protease activated receptor 2 


S100A7 


S100 calcium-binding protein A7 (PSOR1) psoriasin 1 


TNSF6 


Tumor necrosis factor (ligand) superfamily, member 6 
(FASL) Fas ligand 
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Table 5. Prostate Selected Panel (See Figure 30) 



PC A 


riosiaie opecnic /vnugen 


DL/J 


r rosiaie cancer aiuigen j 


ourvivin 


/\popiosis inniDiiur s- 




r rosiaie opeciiic xvieniDrane /vniigen 




JTUldlC xT-jrillWldoC 1 


TFRT 

1 JJ/lV 1 


IClUillCld^C JXCVC15>C 1 ralldCripLaSC 




TplomprpQp Gntalvtip Sliihiinit 


l\L/lViL 


rTiimnn T^nlliVrpin 9 
Human rvdiiijvi ciii l* 


PDFF 


Pmctaf'P-T'iprivpfT Ftc Fjiptnr 

X iUoldlC "lyCllVCLl X_/l3 JTalAUl 


psga 


Prnetntp ^Itpin GpII Antitrpn 

x lUMcllC OIC111 V^CII ^YlltlgCll 


POV1 

I W V 1 


Prnctfitp Gnnppr OvprPYnrpccpH Gptip 1 

X IvJMaLC V_.allCCl W VClCApiCMCLl \JC11C 1 


PART-1 


Prnetiitp A nHmcrpn-P pcrnl ntp/T r rr5»ncr , T*int 1 
x lUoldlC /\IlUIUgCIl IxC^UlalCU. llallovllUl 1 


1Y1 X v> 


L> llljfVv 


NRP1 


1NCU1UU1111111 1 


KAT1 


xlUIIlall lvlCLa»la!Sl!S OUppiCSaUI VJCI1C 


T GAT SR 


vJdlCClllI O 


pio 


V^ycllIl'l-'CpCIlUCIH JNallaoC ^/A. 


vJO 111 


vJlUlaLIUUXlC~0" 1 IdllMCl d^C UlCld 1 


PAT1 

x /\11 


x idMiiinugcn /\ciivaiur iniiiDiiur i 




"R-rpll GT T / T vrrmhnmQ 9 
JD LCli V^JU-Lv / x-#yiiipiiuiiid Z» 


OInlJ 


T'r5iTicr , ri'ntir\Ti5i1 nr*ti\/£itr*r 
lldllddipilUlldi dCLlvdlUi 




IIIIC I ICUKIII O 


u-pa 


\J I UAlIlaoC- 1 y pC x IdMIllIlUgCIl /\V»/LlVdLUI 


JVlX 1 -j 


JVCldllll J 


TGF-R 
1 vji p 


Tron cfr^rmi cr G'rrwx/tTi Fnr*tnr Rpt^ 
1 1 aiidiiii iiuiig vjiuwLii ra\si\jL JDvid 


TT -R 


H11C11CUJSJ.II o 


VFGF 


^/jicpnljir FnHr^tfipl in G-t*n\x/th F^ptor 

Y doV^Uldl l^llLiVJLllClld VJ1L/WL11 1 dCLvJl 


agpp 


^VCIVI UllUoUllaldoC, piuoiaic 


KRT-19 


Ivvl all 1 1 1 Z7 


CK-8 


Gvtnk"pratin R 


Ma^nin 

1YAU.O LM11 


Prntpa<ip Tn hi Hi tor S 

X 1 V-J IL/AjV XI 1111 L/l tv/1 


HMG-I/Y 


r^on-Tii^tnnp pVirnmn^nmjil nrnfpin 

UltJtWllV^ ^111 W 111VJ k5V_7ll Idl UlUlvlll 


TGFR1 

XVJX XV X 


TrKiilin Growth F^rtor Rpppntor 1 

niouiiii vji yj w uii x civ^ ivji L\.\s^r w yj wjx x 


HUPAP 

X X KJ X ill 


TTnman Pro^tntp-A^QOpintpH Protpjwp 

llUlllclll X luolul^/ iiooU^l clL^VJ X lUlVdj^ 


P53 


Tumor Qiinnrp^^or 

X LllllWl O LI UU1 tooUl 


COX-2 


GvpIooy vtrpn^QP 9 
V-/ y tiuuA y gwiiciOv/ 


E-CAD 


e-cadherin 


N-CAD 


n-cadherin 


CTNNA1 


a-1 catenin 


PCANAP7 


Prostate cancer associated gene 7 


MRP1 


Multiple Drug Resistance Protein 1 


HSP-70 


Heat shock protein 


TNF-a 


Tumor Necrosis Factor 
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Table 6 Vascular Selected Panel 



VEGF 


Vascular Endothelial Growth Factor 


NF kappa B 


Nuclear Factor kappa B 


TEK/TIE2 


Tyrosine kinase, endothelial 


ERK2 


MAPK1: mitogen-activated protein kinase 1 


SELE 


selectin E (endothelial adhesion molecule 1) 


Flt-1 


fms-related tyrosine kinase 1 (vascular endothelial 
growth factor/vascular permeability factor receptor) 


PTX3 


pentaxin-related gene, rapidly induced by EL-1 beta 


HMOX-1 


HMOX1 = heme oxygenase (decy cling) 1 


HIF-1 


Hvnoxia-inducible factor 1 aloha subunit 


GRD1 


Glutathione Reductase 1 


iNOS 


_U.1UU.W1L/1W 111 LI 1W VJvYlVJ.W O J 11 111 Clow 


ET-1 


Endothelin 1 


FCF-1 


PfiHntViplin pnrivprtittQ" pnyvmp 1 

l/UUUlllvsllll C'VJll V \sL 1111 ii C'llZiVlllt A 


PT AT 


piaMlllllUgGU CtL/Ll VctliJi, liddUG 


ADAMTS 1 


a UlMlllCgllll -11A.C aim lllCLallupi VJLCaiC ^icpiuiyMii iypc^ 
with tnrnrnHfWTVYnHi n tvr>p 1 motif 1 

wiiii tin uuiuuo l/v/uu.ui iy l/v i muni, i 


PTGIS 


r>m<itJiP'1anHin T9 

1J1 WOlClglClllUlU JL^> 

(nro^tacvclin^ ^vntha^e 

i uiv/jiciv y vim / i3 y ii incio w 


COX-2 


prostaglandin-endoperoxide synthase 2 (prostaglandin 
G/H synthase and cyclooxygenase) 


VCAM 


vascular cell adhesion molecule 1 


IL-8 


Interleukin 8 


n-ip 


Interleukin-1 beta 


IGFBP3 


Insulin-like growth factor binding protein 3 


GJA1 


gap junction protein, alpha 1, 43kD 


ICAM-1 


intercellular adhesion molecule 1 
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Table 7. Rat Liver Metabolism and Toxicity Panel (See Figures 3 1 and 32) 



1 


ALDH 


2 


aldose reductase 


3 


ARG 


4 


CYP1A1 


5 


CYP1B1 


6 


CYP2A2 


7 


CYP2B2 


8 


CYP2C11 


9 


CYP2D2 


10 


CYP2E1 


11 


CYP2A1 


12 


CYP3A1 


13 


CYP1A2 


14 


CYP4A1 


15 


CYP4A3 


16 


CYP4F1 

v-' X X i X X 


17 


cvtochronie P450 oxidoreductase 

j V Will VillV X I V VimlVivl V\*UVVI*iJV 


18 


epoxide hydrolase 


19 


HD 


20 


MAO-B 


21 


quinone reductase 


22 


alpha- 1 -AGP 


23 


PPAR-alpha 


24 


GGT | 
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Table 8 - Master List for Gene Expression Panels 



Symbol 


Alias (es) 


Name 


Classification 


ABCC1 


Ivirvr, IvlJtvr I, Uj- 

X. ABCC. ABC29 


/\ i F-r>inuing v^dbScLie, 
Sub-Family C, Member 1 


i^ivcr riedim intiicdior 


ART 1 


APT ITVl n 
AdL, J 1 Iv/, C- 

ABL, pi 50 


v -aoi /\Deison iviunne 
Leukemia Viral Oncogene 
Homolog 1 


oncogene / enzyme . 
kinase 




DAD 

rAr 


Acid Phosphatase, Prostate 


Enzyme: phosphatase 


ACTB 


b-actin, Actin, 
cytoplasmic l,beta- 
cvtoskeletal actin 


Actin, Beta 


Cell Structure 


ADAM 17 


CSVP, TACE, 
TNF-a converting 
enzyme 


A Disintegrin and 
Metalloproteinase Domain 
17 


Metalloproteinase 


ADAMTS1 


METH1, C3-C5, 
KIAA1346 


A Disintegrin-Like and 
Metalloproteinase 
(Reprolvsin Type) with 
Thrombospondin Type 1 
Motif, 1 


Metalloproteinase 


AHR 


receptor, /\niv 


/\ryi nyurocdrDon ivecepior 


Kecepior / i ranscripuon 
Factor 


ALB 




rviDUinin 


j^iver nediin inuicdior 


ALOX5 


S - (5) 1 i no y v crpn a ^p 

5-@LO. 5-LO. 
LOG5 


ArarhiHnnatp S- 

Lipoxygenase 


Fn7vmp' 1 vr>nY vcrpnfiQp 


AMACR 


RACE 


Alpha-Methylacyl-CoA 
Racemase 


Proteinase: thiol 


ANXA11 


CAP-50, ANX11. 
Annexin XI, 56 kDa 
autoantigen 


Annexin All 


Phospholipid Binding 
Protein 
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APAFl 


CED4. KIAA0413 


Apoptotic Protease 
Activating Factor 1 


Proteinase Activator 


APOE 


Apo-E 


Apolipoprotein E 


Ligand 


ARG2 


E>C> 3.5.3.1 


Arginase 


Enzvme 


B7* 


B7H2, B7RP1, 
GL50, *Not a 
HUGO gene 
symbol* 


B7 Protein 


Cell Signaling and 
Activation 


BAD 


BCL2L8. BBC2. 
BBC6, 

binding protein 


BCL2 Agonist of Cell 
Death 


Membrane Protein / 
Apoptosis 


BAK1 


BAK, CDN1, 
BCL2L7, Cell death 
inhibitor 1 


BCL2-Antagonist / Killer 1 


Membrane Protein / 
Apoptosis 


BAX 


Apoptosis regulator 
Bax 


BCL2-Associated X Protein 


Membrane Protein / 
Apoptosis 


BCL2 


Apoptosis regulator 
Bcl-2 


B-Cell CLL / Lymphoma 2 


Membrane Protein / 
Apoptosis 


BCL2L1 


BCL-XL/S, 
BCL2L, BCLX, 
dLLaL, dLLao, 

Bcl-X 


BCL2-Like 1 (Long Form) 


Membrane Protein / 
Apoptosis 


BCL3 


BLC4, B-cell 

leukemia/lymphoma 

3 


B-Cell CLL / Lymphoma 3 


Transcriptional Regulator 


BID 


None 


BH3-Interacting Death 
Domain Agonist 


Ligand / Apoptosis 


BIK 


BIPLBP4, NBK, 
BBC1 


BCL2-Interacting Killer 


Apoptosis 


BIRC2 


APILCIAPLC- 
IAP, IAPLMIHB, 
MIHC 


Baculoviral IAP Repeat- 
Containing 2 


Apoptosis Inhibitor 
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BIRC3 


APE. C-IAP1. 
IAP2 MIHB- 
MIHC, cIAP2 


Baculoviral IAP Repeat- 
Containing 3 

V/ III Ul 11111 fL^ 


Apoptosis Inhibitor 


BIRC5 


Survivin, API4, 
EPR-1 IAP4 SVV 


Baculoviral IAP Repeat- 
Containing 5 


Apoptosis Inhibitor 


BPI 




bactericidal/permeability-increasing protein i 


BSG 


EMMPRIN, 5F7, 
CD 147 OK M6 
TCSF 


Basignin (OK Blood Group) 


Cell Signaling and 
Activation / Membrane 
Protein 


C1QA 


C10A1, Serum 
CIO 


Complement Component 1, 0 
Subcomponent, Alpha 
Polypeptide 


Complement 
Component 


C1QB 


None 


Complement Component 1, 0 
Subcomponent, Beta 
Polypeptide 


Complement 
Component 


CALCA 


- 


Calcitonin/calcitonin-related 
polypeptide, alpha 


Cell-signaling and 
activation 


CASP1 


ICE, EL-1BC, 
IL1BC, EL1BCE, 
IL IB-con vertase, 
P45 


Caspase 1 


Proteinase / Apoptosis 


CASP3 


Yama, Apopain, 
CPP32 CPP32B 
SCA-1 


Caspase 3 


Proteinase / Apoptosis 


CASP9 


APAF3, MCH6, 
ICE-LAP6 


Caspase 9 


Proteinase / Apoptosis 


CAT 


EC 1.11.1.6 


Catalase 


Enzyme: redox 


CCL2 


SCYA2. MCP1, 
HCll.MCAF, 

CF 


Chemokine (C-C Motif) 
Ligand 2 


Chemokine 


CCL3 


SCYA3. LD78- 
Alpha. MIP1A 
SIS-beta, G0S19-1 


Chemokine (C-C Motif) 
Ligand 3 


Chemokine 
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CCL5 


SCYA5, 

D17S136E, 

R ANTES, TCP228 


Chemokine (C-C Motif) 
Ligand 5 


Chemokine 


ecu 


MCP-3, NC28, FIC, 
MARC SCYA6, 
SCYA7 


Chemokine ( C-C Motif) 
Ligand 7 


Chemokine 


CCL8 


MCP-2, MCP2, 
HC14. SCYA8, 
SCYA10 


Chemokine (C-C Motif) 
Ligand 8 


Chemokine 


CCNA2 


CCN1.CCNA, 
Cvclin A 


Cvclin A2 


Cvclin 


CCNB1 


CCNB, G^/mitotic- 
soecific cvclin B 1 


Cvclin Bl 


Cvclin 


CCND1 


BCLLPRADL 
d/S -specific cvclin 
Dl 


Cvclin Dl 


Cvclin 


CCND3 


Gi/S -specific cvclin 
D3 D3-tVDe cvclin 


Cvclin D3 


Cvclin 


CCNE1 


Cvclin E, CCNE 


Cvclin El 


Cvclin 


CCR1 


CC-CKR-1, 
CMKRl,MIPlaR, 

T> A TVTTT3 C D 

SCYAR1 


Chemokine f C-C motif) 
Receptor 1 


Chemokine Receptor 


CCR3 


CC-CKR-3, 


Chemokine (C-C motif) 


Chemokine Receptor 


CMKBR3, CKR3, 
Eotaxin receptor 


Receptor 3 




CCR5 


CKR-5, CKR5, 
chemrl3, CC-CKR- 
5 CMKBR5 


Chemokine (C-C motif) 
Receptor 5 


Chemokine Receptor 


CD3Z 


CD3-Zeta, CD3H, 
CD30, T3Z, TCRZ 


CD3 Antigen, Zeta 
Polypeptide 


Cell Marker 


CD4 


p55, T-cell antigen 


CD4 Antigen 


Cell Marker 


T4/leu3 






CD8A 


CD8, LEU2, MAL, 
p32, CD8 T-cell 
antigen LEU2 


CD8 Antigen, Alpha 
Polypeptide 


Cell Marker 
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CD14 


LPS-R, Monocyte 
differentiation 
antigen CD 14 


CD 14 Antigen 


Cell Marker 


CD19 


LEU 12, B- 
lymphocvte antigen 
CD19 


CD 19 Antigen 


Cell Marker / Growth 
Factor 


CD22 


BL-CAM, Leu- 14, 
SIGLEC2 


CD22 Antigen 


Cell Marker 


CD34 


Hematopoietic 
progenitor cell 
antigen, HPCA1 


CD34 Antigen 


Cell Marker 


CD44 


CD44R, IN, MC56, 
MDU2, MDU3, 
M1U4, rgpl, LrlK 


CD44 Antigen 


Cell Marker / Receptor 


CD48 


BCMl, BLAST, 
Lvmphocvte 
antigen, MiiM-iUz, 
BLAST 1 


CD48 Antigen 


Cell Marker 


CD68 


Macrosialin, 
GP110, SCARD1 


CD68 Antigen 


Cell Marker 


CD69 


AIM. BL-AC/P26. 
EALGP32/28. 
Leu-23. MLR-3 


CD69 Antigen (p60. Earlv T- 
Cell Activation Antigen) 


Receptor / Cell 
Signaling and 
Activation 


CD86 


B7-2 


CD86 Antigen (CD28 antigen 
ligand 


Cell signaling and 
activation 


CDH1 


ECAD, UVO, Arc- 
1, CDHE, LCAM, 
Uvomorulin 


Cadherin L Tvpe L E- 
Cadherin (Epithelial) 


Cell Adhesion 


CDH2 


NCAD, CDHN 


Cadherin 2, Type 1 , N- 
Cadherin (Neuronal) 


Cell Adhesion 


CDK2 


Cell division kinase 
2, p33 


Cvclin-Dependent Kinase 2 


Enzvme: kinase 


CDK4 


PSK-J3, 

MGC 14458, Cell 
division kinase 4 


Cvchn-Dependent Kinase 4 


Enzvme: kinase 


CDKN1A 


p2r atl , CAP20, 


Cvclin-Dependent Kinase 


Tumor Suppressor / 


CIP1, SDILMDA- 


Inhibitor 1A 


Cell Cvcle Control 


6, MDA6, PIC1 
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CDKN2A 


d16.MTS1.INK4. 
ARF, CDK4I, 
CDKN2. CMM2 


Cvclin-Dependent Kinase 
Inhibitor 2A 


Tumor Suppressor / 
Cell Cycle Control 


CDKN2B 


Dl5.MTS2.TP15. 
Dl4-INK4b 


Cvclin-Dependent Kinase 
Inhibitor 2B 


Tumor Suppressor / 
Cell Cycle Control 


CHEK1 


CHKl.Cell cycle 
checkpoint kinase 


CHK1 (Checkpoint, S. pombe) 
Homologue 


Enzyme: kinase 


CLDN14 


DFNB29. Deafness, 
autosomal recessive 
29 


Claudin 14 


Cell Adhesion 


COL1A1 


Procollagen 


Collagen, Tvpe 1, Alpha 1 


Extracellular Matrix / 
Tissue Remodeling 

-1. IWlilV/Viwllll^ 


COL7A1 


EBD1.EBR1. 
EBDCT 


Collagen, Tvpe VII, Alpha 1 


Extracellular Matrix 


CRABP2 


RBP6, CRABP-II. 
CRABPII 


Cellular Retinoic Acid Binding 
Protein 2 


Transcriptional 
Regulator / Small 
Molecule Binding 
Protein 


CREB3 


LZIP. MGC 15333. 
MGC 19782 


cAMP Responsive Element 
Bindine Protein 3 fLuman^ 

1 J 111 V_* ± M. M. X A V/ twill \UU111U11 / 


Transcription Factor 


CRP 


PTX1 


C-Reactive Protein, Pentraxin 
Related 


Acute Phase Protein 


CSF2 


GM-CSF 


Colony Stimulating Factor 2 
( Granulocyte-Monocyte) 


Cytokine / Growth 
Factor 


CSF3 


G-CSF 


Colony Stimulating Factor 3 
(Granulocyte) 


Cytokine / Growth 
Factor 


CTGF 


NOV2. IGFBP8, 


Connective Tissue Growth 


Growth Factor / Wound 


HCS24, CCN2, 
IGFBPR2 


Factor 


Response 


CTLA4 


CD 152 


Cytotoxic T-Lymphocyte 
Associated Protein 4 


Membrane Protein / 
Immune Response 


CTNNA1 


Cadherin-associated 
Drotein, CAP 102 


Catenin, Alpha 1 


Cell Adhesion / Cell 
Structure i 
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CTSB 


APPS, CPSB, APP 
secretase 


Cathepsin B 


Proteinase: thiol 


CTSZ 


CTSX 


Cathepsin Z 


Proteinase: thiol 


CX3CR1 


CCRL1, CMK- 
BRL-L CMKDRL 
GPR13, GPRV28 


Chemokine (C-X3-C) Receptor 
I 


Chemokine Receptor 


CXCL1 


GROL SCYBL 
GROA, MGSA, 
MGSA-a, NAP-3 


Chemokine (C-X-C Motif) 
Ligand 1 


Chemokine / Growth 
Factor / Oncogene 


CXCL2 


GR02, SCYB2, 
MIP2A, GROB, 

OKUD 


Chemokine (C-X-C Motif) 
Ligand 2 


Chemokine / Growth 
Factor , 


pCXCLl_2 


Primate 

(Cvnomologous 
Monkey) Gene, 
GROl; GR02 


Chemokine (C-X-C Motif) 
Ligand 1: Chemokine (C-X-C 
Motir) Ligand Z 


Chemokine / Growth 
Factor 


CXCL10 


SCYB10, IP10, 
IFI10, CRG-2, 
MOB-1 


Chemokine (C-X-C Motif) 
Ligand 10 


Chemokine 


CXCL12 


SCYB12. SDFL 
PBSF, TPAR1, 
hIRH 


Chemokine (C-X-C Motif) 
Ligand 12 


Chemokine 


CXCR3 


GPR9, CD183, 
CKR-L2, IP10-R. 

JVLlg-K, MlgK, Ir lu 


Chemokine (C-X-C Motif) 
Receptor 3 


Chemokine Receptor 


CXCR4 


Fusin, HSY3RR, 
HM89. LAP3. 
LCR1, LESTR, 
NPYRL 


Chemokine (C-X-C Motif) 
Receptor 4 


Chemokine Receptor 


CYBB 


NOX2, CGD, 

GP91-PHOX, 
riPQ 1 duhy nPQ i 

UryirnUA, ur7 1- 

l 


Cytochrome B-245 Beta 
Polypeptide 


Enzyme: redox 


CYP1A1 


CYPl. AHRR, 
AHH. P450DX, Pl- 
450.P450-C. CPU 


Cytochrome P450, Subfamily I 
(Aromatic Compound- 
Inducible), Polypeptide 1 


Enzyme: metabolism 


CYP1A2 


CP12. P3-450, 


Cytochrome P450, Subfamily I 


Enzyme: metabolism 


P450(PA) 


(Aromatic Compound- 
Inducible), Polypeptide 2 
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CYP2C19 


CYP2C, P450C2C, 
P450HC19, CPCJ 


Cytochrome P450, Subfamily 
IIC (Mephenvtoin 4- 
nyaroxyiasej, roiypepuae iy 


Enzyme: metabolism 


CYP2D6 


P450-DB1,CPD6, 
CYP2D6L, 

rVDOHT 1 

P450C2D 


Cytochrome P450, Subfamily 
IE) (Debrisoquine, Sparteine, 
etc., -MetaDOiizins), 
Polypeptide 6 


Enzyme: metabolism 


CYP2E1 


CYP2E, P450C2E, 
CPE1, P450-J 


Cytochrome P450, Subfamily 
TTR (Ethanol-Inducible), 
Polypeptide 1 


Enzyme: metabolism 


CYP3A4 


P450C3, 

P450PCN1, CP34, 
NF-25 


Cytochrome P450, Subfamily 
III A (Niphedipine Oxidase), 
Polypeptide 4 


Enzyme: metabolism 


DAD1 


DAD-1 


Defender Against Cell Death 1 


Enzyme / Apoptosis 
Inhibitor 


DC13* 


*Not a HUGO gene 
symbol* 


DC 13 Protein 


Not Classified 


DCN 


DSPG2, PG-S2, 
PG40, PGH, 
Proteoglycan II, 
PGS2 


Decorin 


Extracellular Matrix 


DFFB 


CAD, CPAN. 
DFF2, DFF40 


DNA Fragmentation Factor, 40 
kD, Beta Polypeptide 
(Caspase- Activated Dnase) 


Enzyme: nuclease 


DPP4 


CD26, ADCP2, 
DPPIV 


Dipeptidylpeptidase IV 


Proteinase 


DSG1 




Desmoglein 1 


Membrane protein 


DTR 


DTS, HB-EGF, 
HBEGF, HEGFL 


Diphtheria Toxin Receptor 


Cell Signaling / 
Mitogen 


DUSP1 


CL100, PTPN10, 


Dual Specificity Phosphatase 


Enzyme: phosphatase 




HVH1.MKP-1, 
VH1.MKP1 


i 




ECE1 


ECE, ECE-1 


Endothelin Converting 
Enzyme 1 


Metalloproteinase 
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EDN1 


ET1 


Endothelin I 


Peptide Hormone 


EDR2 


- 


Early Development Regulator 
2 


Transcription Factor 


EGRl 


NGF1A, KROX-24, 
TIS8, ZIF-268, 
LrUoJU, INLrrlA, 
ZNF225 


Early Growth Response l 


Transcription Factor 


ELA2 


Medullasin, NE, 
SERP1 PMN 
elastase 


Elastase 2, Neutrophil 


Proteinase: serine 


EPHX1 


EPHX, MEH. EH, 
EPOX, FHS 


Epoxide Hydrolase 1, 
Microsomal (Xenobiotic) 


Enzyme: metabolism 


ERBB2 


C-erbB-2, HER2, 
NEU, NGL, 
pl85erbB2, TKR1 
Herstatin 


V-erb-b2 Erythroblastic 
Leukemia Viral Oncogene 
Homolog 2 


Oncogene / Receptor 


ERBB3 


HER3, c-erbB3, 
Oncogene ERBB3 


V-erb-b2 Erythroblastic 
Leukemia Viral Oncogene 
Homolog 3 


Oncogene / Receptor 


ESR1 


ER, ER-alDha. ESR. 
ESRA, Era, NR3A1 


Estrogen Receptor 1 


Receptor / 
Transcription Factor 


F3 


Thromboplastin, 
TFA. CD142, 
Factor HI 

(coagulation), HTF 


Coagulation Factor III 


Enzyme: redox 


FADD 


MORTl, 
MGC8528, 
Mediator of 
receptor-induced 
toxicity 


Fas (TNFRSF6)- Associated 
Via Death Domain 


Apoptosis / Co- 
Receptor 


FAP 


Seprase, FAP A 


Fibroblast Activation Protein, 
Alpha Subunit 


Liver Health Indicator / 
Proteinase 


FCGR1A 


_ 


Fc fragment of IgG, high 
affinity receptor 




FGF2 


BFGF, FGFB. 
HBGF-2, HBGH-2. 
Prostatropin 


Fibroblast Growth Factor 2 
(Basic) 


Growth Factor 
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FGF7 


KGF, Keratinocvte 
Growth Factor 


Fibroblast Growth Factor 7 


Growth Factor 


FGF18 


zFGF5 


Fibroblast Growth Factor 18 


Growth Factor 


FLT1 


VEGFR1. FRT. 
FLT 


FMS-Related Tyrosine Kinase 
I 


Receptor 


FN1 


CIG. FN, LETS, 
LETS FNZ FENC 


Fibronectin 1 


Cell Adhesion 


FOLH1 


PSMA, GCP2. 
FOLH NAALAD1 
PSM 


Folate Hydrolase (Prostate- 
Snecific Membrane Antipen^ 1 


Enzyme: hydrolase 


FOS 


Oncogene FOS, 
G0S7 


V-fos FBJ Murine 
Osteosarcoma Viral Oncogene 
Homolog 


Oncogene / 
Transcrintion Factor 


FTL 




Ferritin, light peptide 




GADD45A 


GADD45, DDIT1 


Growth Arrest and DNA 
Damage Inducible Alnha 


Cell Cvcle Control / 
Anontosis 


GCG 


GLP1, GLP2, 
GPvPP Glicentin- 

V-J AVI A ^ VillVvllllll 

related polypeptide 


Glucagon 


Peptide Hormone 


GCGR 


GGR, GL-R 


Glucagon Receptor 


Receptor 


GCK 


Hexokinase 4, GK, 
GLK, HK4, HKTV, 
HXKP, MODY2 


Glucokinase 


Enzyme: kinase 


GCLC 


GCS, GLCLC, 
GLCL 


Glutamate-Cvsteine Ligase, 
Catalytic Subunit 


Enzyme: ligase 


GFPT1 


GFA, GFAT, 
GFAT1, GFPT 


Glutamine-Fructose-6- 
Phosphate Transaminase 1 


Enzyme: 
amidotransferase 








GGT1 


Glutamyl 


Gamma-Glutamyltransferase 1 


Liver Health Indicator 


transpeptidase, 
GTG, GGT, CD224 
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GJA1 


CX43. Connexin 
43, DFNB38, Heart 
connexin 


Gap Junction Protein, Alpha 1, 
43kD 


Channel Protein / 
Transporter 


GSR 


GR. GRASE, 
GLUR, GRD1 


Glutathione Reductase 


Enzyme: redox 


GST 


Primers/probe set 
nonspecific for all 
members 01 Cjo 1 
family 


Glutathione S-Transferase 
Class 


Enzyme: metabolism 


GSTA1.2 


GST-epsilon, 
GSTA1-1 GTH1' 
GST-gamma, GST2 


Glutathione S-Transferase Al; 
Glutathione S-Transferase A2 


Enzymes: metabolism 


GSTM1 


GST1, GSTM1-1, 
GTH4 GSTMla- 
la, GSTMlb-lb 


Glutathione S-Transferase Ml 


Enzyme: metabolism 


GSTT1 


Glutathione 
transferase Tl - 1 


Glutathione-S -Transferase 
Theta 1 


Enzyme: metabolism 


GYS1 


GYS 


Glycogen Synthase 1 (Muscle) 


Enzyme: glycogen 
metabolism 


GZMB 


CTLA1, Granzvme 
2,C11,CCPI, 
CGL-1, CSP-B, 
GRB 


Granzyme B 


Proteinase 


HIF1A 


MOP1, ARNT 
Interacting Protein 


Hypoxia-Inducible Factor 1, 
Alnha Subunit 

A. 11|711U VJ \A- \J Villi I, 


Transcription Factor 


HK2 


HKII, HXK2, 
Muscle form 
Hexokinase 


Hexokinase 2 


Enzyme: hexokinase 


HLA-DRB1 


HLA class II 
histocompatibility 
antigen, DK-1 beta 
chain 


Major Histocompatibility 
Complex, Class n, DR Beta 1 


Histocompatibility 


HMGA1 


HMGIY, HMG- 
I/HMG-Y, HMG- 
I/Y, HMG-R 


High-Mobility Group AT- 
Hook 1 


Transcriptional 
Regulator / Oncogene 


HMGB1 


HMG1, HMG3, 
Amphoterin 


High-Mobility Group Box 1 


Transcriptional 
Regulator 
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HMGIY 




High mobility group protein, 
isoforms I and Y 


DNA binding - 
transcriptional 
regulation - oncogene 


HMOX1 


HOI. HO. HO-1. 
bK286B10, Heat 
shock protein 32K 


Heme Oxygenase (Decycling) 
I 


Enzyme: redox 


HSPA1A 


HSP-70, HSP70-1 


Heat Shock Protein 1 A, 70kD 


Cell Signaling and 
Activation 


HUPAP* 


*Not a HUGO eene 
symbol* 


Human Prostate- Associated 
Proteinase 


Proteinase 


ICAM1 


CD54, BB2, Human 
rhinovirus receptor 


Intercellular Adhesion 
Molecule 1 


Cell Adhesion / Matrix 
Protein 


ICOS 


AILIM 


Inducible T-Cell Co-Stimulator 


Receptor 


IFI16 


IFNGIP1 


Interferon, Gamma-Inducible 
Protein 16 


Transcriptional 
Repressor 


IFNA2_8_10 


LeBF-A; LeiF-B; 
LelF-C 


Interferon, Alpha 2; Interferon, 
Alpha 8* Interferon Alnha 10 


Cytokines / Apoptosis 


IFNG 


IFG. IFI. IFN-g 


Interferon, Gamma 


Cytokine 


IGF1R 


JTK13 


Insulin-Like Growth Factor 1 
Receptor 


Receptor / Enzyme: 
kinase 


IGFBP3 


IBP3, IGFBP-3, 
IBP-3, IGF-binding 

LJl \J I. Will 


Insulin-Like Growth Factor 
Binding Protein 3 


IGF-Binding Protein 


ILIA 


ILla,ILl-AlDha, 
IL-1A, IL1, IL1F1. 
Hematopoietin-1 


Interleukin 1, Alpha 


Cytokine 


IL1B 


ILlb, ILl-Beta, IL- 


Interleukin 1, Beta 


Cytokine 


1B. IL1F2. 






IL1R1 


IL1R1RA, IL1RA, 
p80, IL-1R-LIL- 
lR-alpha, IL1R, 
CD12 


Interleukin 1 Receptor, Type I 


Receptor / Cell 
Signaling and 
Activation 
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IL1RN 


ICIL-1RA,IL1F3, 
IL-1RA, IRAP. IL- 
1RN, IL1RA 


Interleukin 1 Receptor 
Antagonist 


Cytokine Antagonist 


IL2 


TCGF 


Interleukin 2 


Cytokine / Growth 
Factor 


IL2RA 


IL2R, P55, TCGFR, 
CD25 TAC antigen 


Interleukin 2 Receptor, Alpha 


Receptor / Cell 
Signaling and 
Activation 


IL4 


BSF1 


Interleukin 4 


Cytokine / Growth 
Factor 


IL5 


EDF, Eosinophil 
differentiation 
factor, TRF, E-CSF 


Interleukin 5 


Cytokine / Growth 
Factor 


IL6 


Interferon beta 2, 
IFNB2 BSF2 HSF 


Interleukin 6 


Cytokine / Growth 
Factor 


IL7 


IL-7 


Interleukin 7 


Cytokine / Growth 
Factor 


IL7R 


IL-7R-alnha, 
CD 127 antigen, 
CD W 127, p90 1L- 
7R 


Interleukin 7 Receptor 


Receptor 


IL8 


CXCL8, SCYB8, 
MDNCF 


Interleukin 8 


Chemokine 


IL10 


CSIF,IL-10,TGIF, 
Cytokine synthesis 

inhihitnrv faptnr 


Interleukin 10 


Cytokine 


IL12B 


IL12p40, NKSF2, 
CLMF, CLMF2, 
r>40 


Interleukin 12 <p40) 


Cytokine / Growth 
Factor 


IL13 


IL-13,NC30, d600 


Interleukin 13 


Cytokine 








IL15 


IL-2-like cytokine 


Interleukin 15 


Cytokine / Growth 






Factor 


ILI7 


CTLA8, IL-17,IL- 
17A, IL17A 


Interleukin 17 


Cytokine 
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IL18 


IGIF, IL-18, IL-lg, 
IL1F4, Interleukin- 

is 


Interleukin 18 


Cytokine / Growth 
Factor 


IL18BP 


IL18BPa, IL18BPb, 
IL18BPc 


Interleukin- 18 Binding Protein 


Cytokine Inhibitor 


IL18R1 




Interleukin 18 receptor 1 


Membrane protein 


INS 


None 


Insulin 


Hormone 


IRF5 


IRF-5 


Interferon Regulatory Factor 5 


Transcription Factor 


IRS1 


HIRS-1 


Insulin Receptor Substrate 1 


Signal Transduction 


ITGAM 


CD11B 


Integrin, alpha M; complement 
receptor 


Integrin 


IVL 


None 


Involucrin 


Cell Structure 


JUN 


CJUN, Proto- 
oncogene c-Jun, 
AP-1, API 


V-iun Avian Sarcoma Virus 17 
Oncogene Homolog 


Transcription Factor 


KAI1 


SAR2, CD82, ST6, 
R2, IA4, 4F9, C33. 
GR15 


Kangai 1 


Tumor Suppressor 


K-ALPHA- 
1* 


*Not a HUGO eene 
symbol* 


Tubulin, Alpha, Ubiquitous 


Cell Structure 


KITLG 


MGF (formerly). 
Stem cell factor 

(oLr), or, JVL.-1, 

KJTL 


KIT Ligand 


Liver Health Indicator / 
Growth Factor 


KLK1 


Tissue kallikrein, 
KLKR, Kallikrein 
serine protease 1 


Kallikrein 1 (Renal / Pancreas / 
Salivary) 


Proteinase 


KLK2 


hGK-1, Glandular 


Kallikrein 2 (Prostatic) 


Proteinase 


kallikrein 
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KLK3 


PSA. APS. KLKB1, 

Semeno?elase 

Seminin 


Kallikrein 3 (Prostate Specific 
Antigen) 


Proteinase 


KRT5 


EBS2. CK5. K5. 
Cvtokeratin 5 


Keratin 5 


Cell Structure 


KRT8 


K8, CK8, CARD2, 
CYK8, Cvtokeratin 
8 


Keratin 8 


Cell Structure 


KRT14 


EBS3, EBS4, 
CK14, K14 
Cvtokeratin 14 


Keratin 14 


Cell Structure 


KRT16 


K16. NEPPK. 
CK16 Cvtokeratin 
16 


Keratin 16 


Cell Structure 


KRT19 


K19. Keratin (tvpe 
D 40-kD 


Keratin 19 


Cell Structure 


LBP 




Lipopolvsaccharide binding 
nrotein 


Membrane protein 


LGALS3 


LGALS2, Galectin 
3 GALBP MAC2 


Lectin, Galactoside-Binding, 
Soluble 3 


Liver Health Indicator 


LGALS8 


PCTA-1. Galectin 
8 PCTA1 Po66- 
CBP 


Lectin, Galactoside-Binding, 
Soluble 8 


Cell Adhesion / Growth 
and Differentiation 


LTA 


TNFSF1, Tumor 
necrosis factor beta 
(iormerlv), IiNrr* 


Lvmphotoxin, Alpha 


Cvtokine 


MADD 


DENN, IG20, 
Insulinoma- 
glucagonoma 
protein 20 


MAP-Kinase Activating Death 
Domain 


Co-Receptor 


MAP3K1 


MAPKKK1, 
MEKK1, MEKK, 
MAP/ERK kinase 
kinase 1 


Mitogen- Activated Protein 
Kinase Kinase Kinase 1 


Enzvme: kinase 


MAP3K14 


NF-kB Inducing 
Kinase, NIK, 
HSNIK, FTDCR1B. 
HS 


Mitogen-Activated Protein 
Kinase Kinase Kinase 14 


Enzvme: kinase 
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MAPK1 


ERK2, ERK, ERT1, 
MAPK2. PRKM1, 

n OQ r\Af\ r\A 1 

Pjo, p4U, p41 


Mitogen- Activated Protein 
Kinase 1 


Enzyme: kinase 


MAPK8 


JNK1.JNK. 
SAPK1, PRKM8, 

TXTT^I AO 
JIN Jxl AZ, 

JNK21B1/2 


Mitogen- Activated Protein 
Kinase 8 


Enzyme: kinase 


MAPK14 


CSBP. CSBP1. d38, 
Mxi2, PRKM14, 
PRKM15 


Mitogen- Activated Protein 
Kinase 14 


Enzyme: kinase 


MDM2 


HDM2, 
Oncoprotein 
MDM2, d53- 
binding protein 
MDM2 


MDM2, Transformed 3T3 Cell 
Double Minute 2, p53 Binding 
Protein (Mouse) 


Transcription Factor / 
Oncogene 


MHC2TA 


C2TA, CIITA 


MHC Class II Transactivator 


DNA-Binding Protein / 
Activator 


MIF 




Macrophage migration 
inhibitory factor 


Cell signaling and 
growth factor 


MMP1 


Collagenase, CLG, 
CLGN, Fibroblast 
collagenase 


Matrix Metalloproteinase 1 


Metalloproteinase 


MMP2 


Gelatinase, CLG4A, 
CLG4, TBE-L 
oeiatinase A 


Matrix Metalloproteinase 2 


Metalloproteinase 


MMP3 


Stromelysin, 
STMY1, STMY, 

Transin-1 


Matrix Metalloproteinase 3 


Metalloproteinase 


MMP8 


Neutrophil 
collagenase, CLG1, 

Lrvrpi P1V/TNJT CI 


Matrix Metalloproteinase 8 


Metalloproteinase 


MMP9 


Gelatinase B, 
CLG4B, GELB, 
ivi dcropna ge 
gelatinase 


Matrix Metalloproteinase 9 


Metalloproteinase 


MMP12 


Macrophage 
elastase, HME, 
MME 


Matrix Metalloproteinase 12 


Metalloproteinase 


MMP13 


Collagenase 3, 
CLG3 


Matrix Metalloproteinase 13 


Metalloproteinase 
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MMP15 


MT2-MMP, MMP- 
15. SMCP-2. 

A ATI A A AAT> 

MlzMMr, 

MTMMP2 


Matrix Metalloproteinase 15 
(Membrane-Inserted) 


Metalloproteinase 


MMP19 


MMP18 (formerly), 
RASI-l.RASI 


Matrix Metalloproteinase 19 


Metalloproteinase 


MNDA 


None 


Myeloid Cell Nuclear 
Differentiation Antigen 


Cell Signaling and 
Activation 


MP1 




Metalloprotease 1 


Proteinase / Proteinase 
Inhibitor 


MPO 


B.C. 1.11.1.7 


Myeloperoxidase 


Enzyme: redox 


MRE11A 


MRE11, ATLD, 
HNGS1, MRE11B 


Meiotic Recombination (S. 
cerevisiae) 1 1 Homolog A 


Enzyme: nuclease 


MYC 


c-MYC, Oncogene 
MYC 


V-myc Avian 
Myelocvtomatosis Viral 
Oncogene Homolog 


Transcription Factor / 
Oncogene 


N33* 


D8S1992, N33 
protein, *Not a 

XJTT/T1 rwa-na. 

riULrU gene 
symbol* 


Putative Prostate Cancer 
Tumor Suppressor 


Tumor Suppressor 


NFKB1 


KBF1.EBP-1, 
NFKB d50 


Nuclear Factor of Kappa Light 
Polypeptide Gene Enhancer in 
B-Cells 1 (pl05) 


Transcription Factor 


NFKBIB 


TRIP9, IKBB, 
Thyroid hormone 
receptor interactor y 


Nuclear Factor of Kappa Light 
Polypeptide Gene Enhancer in 

TD /""'olio TnViiKit/^r- T5oto 

jd-lxus inniDiior, Beta 


Transcriptional 
Regulator 


NOS1 


NOS, N-NOS, 
NNOS, Neuronal 
inuo, constitutive 
NOS 


Nitric Oxide Synthase 1 
(Neuronal) 


Enzyme: redox 


NOS2A 


iNOS, NOS2 


Nitric Oxide Synthase 2A 
(Inducible) 


Enzyme: redox 


NOS3 


eNOS,cNOS, 
ECNOS 


Nitric Oxide Synthase 3 
(Endothelial) 


Enzyme: redox 
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NR1I2 


PXR, PAR2. 


Nuclear Receptor Subfamily 1, 


Receptor / 


Pregnane X 
Receptor, PARq, 
PRR. SXR 


Group I, Family 2 


Transcription Factor 


NR1I3 


Constitutive 
androstane recentor 
beta. CAR, MB67 


Nuclear Receptor Subfamily 1, 
Grouo T Familv 3 

V_J 1 \J 1*4 yj -1 ^ X C-ll 1111 y »^ 


Receptor / 

Tran serin ti on Factor i 


NRP1 


NRP, VEGF165R 


Neuropilin 1 


Receptor 


ORM1 


A1AGP, Aloha 1 
acid plvconrotein 
AGP-A. ORM 


Orosomucoid 1 


Liver Health Indicator 


OXCT 


OXCT1, SCOT 


3-Oxoacid CoA Transferase 


Enzyme: transferase 


PARTI* 


DKFZP586D0823: 
*Not a HUGO eene 
symbol* 


Prostate Androgen-Regulated 
Transcrint 1 


Unclassified 


PCA3 


Prostate-specific 
gene DD3 DD3 


Prostate Cancer Antigen 3 


Unclassified 


PCANAP7 


IPCA7, IPCA-7 


Prostate Cancer Associated 
Protein 7 


Unclassified 


PCGEM1* 


*Not a HUGO gene 
svmbol* 


Prostate-Specific Non-Coding 
Gene 


'Riboregulator' 


PCK1 


PEPCK-C, 
PEPCK 1 Cvtosolic 
PEPCK 


Phosphoenolpyruvate 
Carboxvkinase 1 


Enzyme: lyase 


PCNA 


DNA polymerase 
delta auxiliary 
protein 


Proliferating Cell Nuclear 
Antigen 


DNA-Binding Protein 


PCTK1 


PCTAIREL 
PCTGAIRE 


PCTAIRE Protein Kinase 1 


Enzyme: kinase 


PDCD8 


AIF, Apoptosis- 
Inducing Factor 


Programmed Cell Death 8 


Enzyme: redox 


PDEF* 


*Not a HUGO gene 


Prostate Epithelium Specific 


Transcription Factor 


symbol* 


Ets Transcription Factor 
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PF4 


CXCL4, SCYB4 


Platelet Factor 4 CChemokine 
(C-X-C Motif) Lieand 4) 

V x. ^— ' ITlVlll / X^Xf^CXXXKX / 


Chemokine 


PHC2 


EDR2, Earlv 
develonment 
regulator 2, HPH2 


Polvhomeotic-Like 2 
(Drosonhila") 


Inhibitor or Repressor 


PI3 


SKALP, ELAFIN, 
ESI, Elastase- 
specific inhibitor 


Proteinase Inhibitor 3 (Skin 
Derived) 


Proteinase Inhibitor 


PIK3R1 


GRB1, PI3K, 
PI3KA Ptdins-3- 
kinase p85-alpha 


Phosphoinosi tide-3 -Kinase , 
Resmlatorv Subunit 
Polypeptide 1 


Regulatory Adaptor 
Protein 

X 1 vlvlll 


PITRM1 


MPLhMPl, 
KIAA1 104 

X^Xl V-iX. A A V~ 


Pitrilvsin Metalloproteinase 1 


Proteinase 


PLA2G2A 


PLA2B, PLA2L, 
MOM1 RASFA 
NPS-PLA2 


Phospholipase A2, Group IIA 


Enzyme: phospholipase 


PLA2G7 


PAFAH, LDL- 
PLA2 

A Av/iZ< 


Phospholipase A2, Group VII 


Enzyme: hydrolase 


PLAT 


TPA, T-PA, 


Plasminogen Activator, Tissue 


Proteinase 


PLAU 


UPA, URK, 
PI asminoffen 

A xClOllAAJAVj^^ll 

activator (urinarv) 


Plasminogen Activator, 
Urokinase 

KJ X VJVAlJltlOV' 


Proteinase 


PLAUR 


UPAR. CD87. 
URKR 


Plasminogen Activator, 
Urokinase Receotor 


Receptor 


PNKP 


PNK. DNA kinase 


Polynucleotide Kinase 3- 
Phosohatase 


Enzyme: phosphatase 


POV1 


R00504, PB39 


Prostate Cancer Over- 
exnressed Gene 1 


Unclassified 


PPARA 


PPAR. HPPAR, 


Peroxisome Proliferator 


Receptor / 


NR1C1 


Activated Receptor, Alpha 


Transcription Factor 








PPARG 


HUMPPARG, 
NR1C3, PPAR-g. 
PPARG3, PPARG2. 
PPARG 1 


Peroxisome Proliferator 
Activated Receptor, Gamma 


Receptor / 
Transcription Factor 
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PRFl 


Preforming protein, 
HPLH2,P1,PFP, 
Pore forming 
protein 


Perforin 1 


Channel Protein / 
Apoptosis 


PRKCB1 


PKCB 


protein kinase C, beta 1 


Enzyme: phosphorvlase 


PRTN3 


PR3. C-ANCA. 
ACPA, MBN, 
AGP7, MBT, 
Myeloblasts, p29 


Proteinase 3 


Proteinase 


PSCA 


None 


Prostate Stem Cell Antigen 


Cell Marker / Antigen 


PTEN 


Mutated in multiple 

aHvanrpH ppmppr^ 1 

BZS, MHAM, TEPI 


Phosphatase and Tensin 

l-fnmnlno' 


Enzyme / Tumor 

Siinnrp^nr 

vj UUUlVJJUl 


PTGIS 


PGIS, PTGI, CYP8, 
CYPRA1 


Prostaglandin 12 (Prostacyclin) 

^Jvntha^p 


Enzyme: isomerase 


PTGS1 


COX 1. COX- 1, 
PGG/HS PGHS1 
PTGHS 


Prostaglandin-Endoperoxide 

Sivntha^p 1 


Enzyme: 

nprox i d a <ip/d i ox vppti a «ip 

ijv>i \jsvx\j-cij\sf \XX\JJ\. y z^^/iicioi/ 


PTGS2 


COX2, COX-2, 
PGG/HS PGHS-2 I 
PHS-2. hCox-2 


Prostaglandin-Endoperoxide 


Enzyme: 

nprnxiHa^p/Hinx vcrpna^p 


PTPRC 


CD45, GP180, 
LCA B220 LY5 
T200 


Protein Tyrosine Phosphatase 
Rerentor Tvdp C 


Cell Marker / Receptor 


PTX3 


Pentaxin 3, TSG14, 
Ppntraxin 3 


Pentaxin Related Gene, 
Ranidlv Induced bv TL-lb 


Acute Phase Protein 


RAD52 


DNA repair protein 
RAD52 


RAD52 (5. cerevisiae) 
Homolop 


DNA-Binding Protein 


RBI 


OSRC, PP110, 
pl05-Rb, Rb 


Retinoblastoma 1 ( Including 
Osteosarcoma) 


Tumor Suppressor 


RXRA 


Retinoic acid 
receptor RXR-a, 
NR2B1 


Retinoid X Receptor, Alpha 


Receptor / 
Transcription Factor 
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S100A7 


PSORl.Psoriasin 1 


S100 Calcium-Binding Protein 
7 


Cell Cvcle Control / 
Small Molecule 
Binding Protein 


SAA1_2 


SAA. Serum 
amyloid A: 
Amyloid A, Serum. 
2 


Serum Amyloid Al; Serum 
Amyloid A2 


Acute Phase Proteins 


SCYA2 


MCP1 


Small inducible cytokine A2 


Cytokine/Chemokine 


SCYA3 


MIP alpha 


small inducible cytokine A3 
(MIP la} 


Chemokine 


SCYA5 


Rantes 


small inducible cytokine A5 
(RANTES) 


Chemokine 


SCYB10 


IFN inducible IP 10 


small inducible cytokine 
subfamilv B (Cvs-X-Cvs) 
member 10 


Chemokine 


SDF1 


TPAR1 


stromal cell-derived factor 1 


Chemokine 


SELE 


ELAM. CD62E. 
ELAM1 ESEL 
LECAM2 


Selectin E 


Cell Adhesion 


SELL 


CD62L, HLHRC, 
LAM1 LECAM1 
LEU8 


Selectin L 


Cell Adhesion 


SELP 


CD62P.GMP-140. 
GMRP, GRMP, 
LECAM3, PSEL 


Selectin P 


Cell Adhesion 


SERPINA1 


Alpha 1 Anti- 
proteinase, AAT, 
PI1, PL A1AT 


Serine (or Cysteine) Proteinase 
Inhibitor Clade A Member 1 


Proteinase Inhibitor 


SERPINA3 


AACT, ACT, 
Alpha- 1 - Anti- 
rhvm nlrvnsi n 


Serine (or Cysteine) Proteinase 
Inhibitor, Clade A, Member 3 


Proteinase Inhibitor 


SERPINB2 


PAL PAI-2, PAI2, 
PLANH2, 
Urokinase inhibitor 


Serine (or Cysteine) Proteinase 
Inhibitor, Clade B 
(Ovalbumin), Member 2 


Proteinase Inhibitor 


SERPINB5 


Maspin, PI5 


Serine (or Cysteine) Proteinase 


Proteinase Inhibitor / 




Inhibitor, Clade B 


Tumor Suppressor 


(Ovalbumin), Member 5 
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SERPINC1 


Antithrombin HI, 
AT3, ATHI 


Serine for Cysteine) Proteinase 
Inhibitor, Clade C 
(Antithrombin), Member 1 


Proteinase Inhibitor 


SERPINE1 


PAIl. Plasminogen 
activator inhibitor 
tvpe 1. PAIR 
PLANH1 


Serine for Cysteine) Proteinase 
Inhibitor, Clade E 
(Ovalbumin), Member 1 


Proteinase Inhibitor 


SERPING1 


C-l esterase 
inhibitor. CI NH. 
Cl-liNri, Cll, 
HAE1, HAE2 


Serine (or Cysteine) Proteinase 
Inhibitor, Clade G (CI 
Inhibitor), Member 1 
(Angioedema, Hereditary) 


Proteinase Inhibitor 


SFTPD 


PSPD 


Surfactant, pulmonary 
associated protein D 


Extracellular 
T lipoprotein 

J-/lL/UL/lVlvlll 


SLC2A4 


GLUT4, Glucose 
transporter 4 

\rl CI.11l31_/W1 Ivl 1^ 


Solute Carrier Family 2 
(Facilitated Glucose 

\ A UvlXllUlvU vJlUvWuV 

1 lallaUUl LCI J, 1V1C111UCI *+ 


Glucose Transporter 


SLC35A2 


UGT. UGAT. 
UGALT UGT1 
TTHT9 TTHTT 


Solute Carrier Family 35 

fl IDP-Galactose Transporter^ 

\KJ±*rl. VJ Cll Cl\^ lUOV A 1 d.llOLy'Wl LV^l /, 

IVICIIIUCI z 


Enzyme: metabolism 


SMAC* 


*Not a HUGO eene 


Second Mitochondria-Derived 


Antagonist / Apoptosis 


svmbol* DIABLO 


Activator of Casoase 




SOCS3 


CIS3, SSI3, SSI-3, 


Suppressor of Cytokine 


Inhibitor or Repressor 


Cish3 SOCS-3 


Sipnalinp ^ 




SOD2 


IPO-B, MnSOD, 


Superoxide Dismutase 2 


Enzyme: redox 


Tndonhenoloxidase 

■P 


( Mitochondrial) 

\ iviiivwiiv/iiux itii j 




SRP19 


Signal recognition 


Signal Recognition Particle 


RNA-Binding Protein 


particle ribosomal 


19kDa 




STAT1 


STAT91 


Signal Transducer and 


Transcription Factor 




Activator of Transcription 1 




STAT3 


APRF 


Signal Transducer and 


Transcription Factor 




Activator of Transcription 3 




STAT5A_B 


MGF, STATS, 


Signal Transducer and 


Transcription Factors 


Mammary gland 


Activator of Transcription 5A: 




factor; STAT5 


Signal Transducer and 




Activator of Transcription 5B 
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STEAP 


PRSS24, STEAP 1. 
MGC 19484 


Six Transmembrane Epithelial 
Antigen of the Prostate 


Transporter / Tumor 
Antigen 

X XXX VX J. A 


TBP 


TFIID. TF2D. 
GTF2D1, GTF2D, 
SCA17 


TATA-Box Binding Protein 


Transcription Factor 


TEK 


TIE2 


tyrosine kinase, endothelial 


Transferase Receptor 


TERT 


TCS1,EST2,TP2, 
TRT Hest2 

X. 1\ X. ^ A. IVO t^tf^ 

hTERT 


Telomerase Reverse 
Transcrintase 

x. x txx m. ij w J. x yy l. Lto 


Enzyme: reverse 
transcrintase 


TGFA 


ETGF, TGF-alpha, 
EGF-like TGF 
TGF tvpe 1 


Transforming Growth Factor, 
Alpha 


Cvtokine / Growth 
Factor / Transferase 


TGFB1 


DPD1.CED. 
HGNC 2997 TGF- 
beta. TGFB. TGF-b 


Transforming Growth Factor, 
Beta 1 


Cytokine / Growth 
Factor 


TGFB3 


TGF-b3 


Transforming Growth Factor, 
Beta 3 


Cvtokine / Growth 
Factor / Cell Sipnalin? 


TGFBR2 


- 


Transforming growth factor, 
beta receptor II 


Membrane protein 


TIE 


JTK14, TIE1 


Tyrosine Kinase with 
Immunoglobulin and 
Epidermal Growth Factor 
Homology Domains 


Receptor / Signal 
Transduction 


TIMP1 


TMP, Ervthroid 


Tissue Inhibitor of Matrix 


Proteinase Inhibitor / 


potentiating 
activity, ciajI, 
EPA, EPO, HCI 


Metalloproteinase 1 


Growth Factor 


TIMP3 


SFD, HSMRK222, 
K222TA2 


Tissue Inhibitor of Matrix 
Metalloproteinase 3 


Proteinase Inhibitor 


TLR2 


TIL4, Toll / 
Interleukin 1 
receptor-like 4 


Toll-Like Receptor 2 


Receptor / Cell 
Signaling and 
Activation 


TLR4 


hToll, TOLL 


Toll-Like Receptor 4 


Receptor / Cell 
Signaling and 
Activation 
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TLX3 


HOX11L2. RNX, 
Homeo box 1 1 -like 
2 


T-Cell Leukemia, Homeobox 3 


Transcription Factor 


TNF 


TNF-abha, TNFa, 
cachectin, DIF, 
IJNhA, INFcvbi 


Tumor Necrosis Factor, 
Member 2 


Cytokine / Growth 
Factor 


TNFRSF1A 


FPF, TNF-R, TNF- 
Rl, TNFAR, 
IJNrKl, lJNrKoU, 
p55, d55-R 


Tumor Necrosis Factor 
Receptor Superfamily, 
Member 1A 


Receptor / Cell 
Signaling and 
Activation 


TNFRSF1B 


TNFR2, p75, 
CD 120b 


Tumor Necrosis Factor 
Receotor Sunerfamilv 
Member IB 


Receptor / Cell 
Signal in 2 and 
Activation 


TNFRSF11A 


RANK, Activator of 
NF-kB, ODFR, 
PDB2 


Tumor Necrosis Factor 
Receptor Superfamily, 
Member HA 


Receptor / Cell 
Signaling and 
Activation 


TNFRSF13B 


TACI, 

Transmembrane 
Activator & CAML 
Interactor 


Tumor Necrosis Factor 
Receptor Superfamily, 
Member 13B 


Receptor / Cell ! 
Signaling and 
Activation 


TNFRSF17 


BCMA, BCM, B- 
cell maturation 
factor 


Tumor Necrosis Factor 
Receptor Superfamily, 
Member 17 


Receptor / Cell 
Signaling and 
Activation 


TNFRSF25 


TNFRSF12 
(formerly), LARD, 
TRAMP, WSL-1, 
TR3. DR3 


Tumor Necrosis Factor 
Receptor Superfamily, 
Member 25 


Receptor / Cell 
Signaling and 
Activation 


TNFSF5 


CD40L. TRAP, 


Tumor Necrosis Factor 


Cytokine 


ep39, HIGM1, 
CD 154, 1MD3, 
IGM, T-BAM 


(Ligand) Superfamily, Member 
5 




TNFSF6 


FasL, APT1LG1, 


Tumor Necrosis Factor 


Cytokine / Apoptosis 


CD95L, APTL, 
APO-1 


(Ligand) Superfamily, Member 
6 




TNFSF12 


TWEAK, AP03L, 


Tumor Necrosis Factor 


Cytokine / Apoptosis 


DR3LG 


(Ligand) Superfamily, Member 






12 


TNFSF13B 


TALL-1, BLYS, 
TALL1, BAFF, 


Tumor Necrosis Factor 
(Ligand) Superfamily, Member 


Cytokine / Growth 
Factor 


THANK, TNFSF20 


13B 




TOSO* 


*Not a HUGO gene 


Regulator of Fas-Induced 


Apoptosis Inhibitor 


symbol* 


Apoptosis 
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TP53 


d53. TRP53 


Tumor Protein 53 (Li- 
Fraumeni Syndrome) 


Transcription Factor / 
Tumor Suonressor 


TRADD 


Tumor necrosis 
factor receDtor-1- 
associated protein 


TNFRSFIA-Associated Via 
Death Domain 


Regulatory Adaptor 
Protein / Anontosis 


TRAF1 


EBI6, MGC: 10353, 
Epstein-barr virus- 
induced mRNA 6 


TNF Receptor- Associated 
Factor 1 


Regulatory Adaptor 
Protein 


TRAF2 


TNF-receptor- 
associated protein, 
MCjC:45U1z, 
TRAP3 


TNF Receptor- Associated 
Factor 2 


Regulatory Adaptor 
Protein 


TRAF3 


CD40BP,LAP1, 
CAP1 CRAF1 

V--' L LI X i V^l\i XX X ^ 

LMP1 


TNF Receptor- Associated 
Factor 3 


Signal Transduction / 
Anontosis 


TREM1 


TREM-1 


Triggering Receptor Expressed 
on Mveloid Cells 1 


Receptor / Cell 
Signal in? and 
Activation 


TXNRD1 


TXNR, TR1 


Thioredoxin Reductase 1 


Enzyme: redox 


UCP2 


SLC25A8, UCPH 


Uncoupling Protein 2 
( Mitochondrial Proton 

y^cLL i 1CI j 


Liver Health Indicator 


UGT 


- 


UDP-Glucuronosvltransferase 


Enzyme, metabolism 


VCAM1 


LI CAM, CD106. 


Vascular Cell Adhesion 


Cell Adhesion / Signal 


INCAM-100 


Molecule 1 


Transduction 


VDAC1 


PORIN, PORIN-31- 
HL, Plasmalemmal 
porin 


Voltage-Dependent Anion 
Channel 1 


Channel Protein / 
Transporter 


VEGF 


VPF, VEGF-A, 


Vascular Endothelial Growth 


Cytokine / Growth 


VEGFA, 

Va 1 1 otrnni n 


Factor 


Factor 


VWF 


F8VWF, VWD 


XT YTTM1 1 * "J— * 

Von Willebrand Factor 


Cell Adhesion / 
Coagulation Factor 


XRCC5 


CTC85, CTCBF. 


X-rav Repair Complementing 


Enzyme: helicase 


G22P2, Ku antigen 


Defective Repair in Chinese 




(80 kDa) 


Hamster Cells 5 
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